# Drummond:PopGen

### From OpenWetWare

(→Statistical analysis of relative growth rates) |
Current revision (22:40, 28 March 2011) (view source) (→Continuous rate of change) |
||

(One intermediate revision not shown.) | |||

Line 112: | Line 112: | ||

|<math>= s p(t)(1-p(t))\!</math> | |<math>= s p(t)(1-p(t))\!</math> | ||

|} | |} | ||

- | This result says that the proportion of type 1 <math>p</math> changes most rapidly when <math>p=0.5</math> and most slowly when <math>p</math> is very close to 0 or 1. | + | This result says that the proportion of type 1, <math>p</math>, changes most rapidly when <math>p=0.5</math> and most slowly when <math>p</math> is very close to 0 or 1. |

==Evolution is linear on a log-odds scale== | ==Evolution is linear on a log-odds scale== | ||

Line 144: | Line 144: | ||

Strains <math>i</math> and <math>j</math> have fitness <math>w_i = e^{r_i}</math> and <math>w_j=e^{r_j}</math>. Define the selection coefficient <math>s_{ij} = \ln \frac{w_i}{w_j} = r_i - r_j</math> as usual. | Strains <math>i</math> and <math>j</math> have fitness <math>w_i = e^{r_i}</math> and <math>w_j=e^{r_j}</math>. Define the selection coefficient <math>s_{ij} = \ln \frac{w_i}{w_j} = r_i - r_j</math> as usual. | ||

We have data consisting of triples (<math>g=</math>number of generations, <math>n_i=</math>number of cells of type <math>i</math>, <math>n_r=</math>number of cells of type <math>r</math>). | We have data consisting of triples (<math>g=</math>number of generations, <math>n_i=</math>number of cells of type <math>i</math>, <math>n_r=</math>number of cells of type <math>r</math>). | ||

- | We have data consisting of pairs<math> | + | We have data consisting of pairs (<math>g=</math>number of generations, <math>p_{ir}= n_i/n_r</math>) where <math>n_i</math>=number of cells of type <math>i</math> and <math>n_r=</math>number of cells of type <math>r</math>. |

What is the best estimate, and error, on <math>s_{ij}</math>? | What is the best estimate, and error, on <math>s_{ij}</math>? | ||

===Model=== | ===Model=== | ||

- | Assuming exponential growth, <math>\ln p_{ir} = | + | Assuming exponential growth, <math>\ln p_{ir} = </math> |

Let <math>\Pr(s_{ij}=t) = \mathcal{N}(t;\mu_{ij}, \sigma^2_{ij})</math>. | Let <math>\Pr(s_{ij}=t) = \mathcal{N}(t;\mu_{ij}, \sigma^2_{ij})</math>. |

## Current revision

## Introduction

Here I will treat some basic questions in population genetics. For personal reasons, I tend to include all the algebra.

## Per-generation and instantaneous growth rates

What is the relationship between per-generation growth rates and the Malthusian parameter, the instantaneous rate of growth?

Let *n*_{i}(*t*) be the number of organisms of type *i* at time *t*, and let *R* be the *per-capita reproductive rate per generation*. If *t* counts generations, then

Now we wish to move to the case where *t* is continuous and real-valued.
As before,

where the last simplification follows from L'Hôpital's rule. Explicitly, let ε = Δ*t*. Then

The solution to the equation

*t*. We can define the

*instantaneous growth rate*

*r*= ln

*R*for convenience.

## Continuous rate of change

If two organisms grow at different rates, how do their proportions in the population change over time?

Let *r*_{1} and *r*_{2} be the instantaneous rates of increase of type 1 and type 2, respectively. Then

*p*(

*t*).

This result says that the proportion of type 1, *p*, changes most rapidly when *p* = 0.5 and most slowly when *p* is very close to 0 or 1.

## Evolution is linear on a log-odds scale

The logit function , which takes , induces a more natural space for considering changes in frequencies. Rather than tracking the proportion of type 1 or 2, we instead track their log odds. In logit terms, with ,

This differential equation *L*_{p}'(*t*) = *s* has the solution

showing that the log-odds of finding type 1 changes linearly in time, increasing if *s* > 0 and decreasing if *s* < 0.

## Diffusion approximation

Insert math here.

## Statistical analysis of relative growth rates

We have three strains, *i*, *j* and *r*, where *r* is a reference strain.
Strains *i* and *j* have fitness and . Define the selection coefficient as usual.
We have data consisting of triples (*g* = number of generations, *n*_{i} = number of cells of type *i*, *n*_{r} = number of cells of type *r*).
We have data consisting of pairs (*g* = number of generations, *p*_{ir} = *n*_{i} / *n*_{r}) where *n*_{i}=number of cells of type *i* and *n*_{r} = number of cells of type *r*.

What is the best estimate, and error, on *s*_{ij}?

### Model

Assuming exponential growth, ln*p*_{ir} =

Let .

### Maximum-likelihood approach

Add text.