User:R. Eric Collins/MBL/Popgen

Population Genetics

mutations-scaled population size
- hard to disentangle, large pop/small mu = small pop/large mu
- confounding between migration rate and divergence rate

unless you have time-series data, don't bother with estimating population size changes through time (e.g. skyline/skyride)
thanks 5x5 migratory model is even a large model
- simplify model if possible to improve confidence/power
- "test hypotheses... don't on fishing expeditions"

to get good estimates, you need: 1) a lot of data 2) a good computer
people with lots of data often don't run analyses long enough to guarantee convergence
F_ST and coalescence are based on same/similar assumptions so really one is not better than the other for recent divergence
shape of population size over time can really affect coalescence but need to know how and how it affects parameter estimation
- e.g. bottlenecks, recoveries, expansions, contractions
when effective population size ~ generations since divergence it can get dicey to separate divergence from migration

felsenstein 2005: after ~10 individuals, should add another locus rather than more individuals

with long times between speciation, the gene tree matches the species tree with increasing probability
two ways to coalesce to ((A,B),(C,D)), one way each to coalesce (((C,D),B),A) and (((C,D),A),B)
- so symmetric trees can be overrepresented
concatenated gene sequences are not the way to add information, can lead to statistically inconsistent results
- but with long branch lengths and lots of genes you get enough power that it's ok
- Bootstrap procedure can be positively misled in this situation

STEM: when only source of variability in single-gene histories is due to thecoalescence process

questions:
- if order generations, can follow min and max to find ancestor of all existing species
- migration as horizontal gene transfer?

"the reason to do a bayesian analysis is not to get a tree but to get the posterior distributions"

HGT versus huge ancestral population size + long coalescent times "bacteria are special"

if there were exponential growth in a population, estimating the mutation rate assuming a constant population size will UNDERESTIMATE the instantaneous mutation rate.
given the instantaneous mutation rate and a population growth model you can estimate the past mutation rate