User:Hussein Alasadi/Notebook/stephens/2013/10/03: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
Line 33: Line 33:
Without loss of generality, we assume that the putative selected site is site <math> j = 1 </math>
Without loss of generality, we assume that the putative selected site is site <math> j = 1 </math>


*'''Model'''
* '''Model at selected site'''
At selected site: <math> log \frac{f_{i,k,1}}{1-f_{i,k,1}} = \mu + \beta g_i + \epsilon_{i,k} </math>
<math> log \frac{f_{i,k,1}}{1-f_{i,k,1}} = \mu + \beta g_i + \epsilon_{i,k} </math>
 
* '''Model at all other sites'''





Revision as of 15:59, 13 October 2013

analyzing pooled sequenced data with selection <html><img src="/images/9/94/Report.png" border="0" /></html> Main project page
Next entry<html><img src="/images/5/5c/Resultset_next.png" border="0" /></html>

Notes from Meeting

Consider a single lineage for now.

[math]\displaystyle{ X_j }[/math] = frequency of "1" allele at SNP j in the pool (i.e. the true frequency of the 1 allele in the pool)

  • Data:

[math]\displaystyle{ (n_j^0, n_j^1) }[/math] = number of "0", "1" alleles at SNP j ([math]\displaystyle{ n_j = n_j^0 + n_j^1 }[/math])


  • Normal approximation

[math]\displaystyle{ n_j^1 }[/math] ~ [math]\displaystyle{ Bin(n_j, X_j) \approx N(n_jX_j, n_jX_j(1-X_j)) }[/math] Normal approximation to binomial

[math]\displaystyle{ \frac{n_j^1}{n_j} \approx N(X_j, \frac{X_j(1-X_j)}{n_j}) }[/math] The variance of this distribution results from error due to binomial sampling.

To simplify, we just plug in [math]\displaystyle{ \hat{X_j} = \frac{n_j^1}{n_j} }[/math] for [math]\displaystyle{ X_j }[/math]

[math]\displaystyle{ \implies \frac{n_j^1}{n_j} | X_j \approx N(X_j, \frac{\hat{X_j}(1-\hat{X_j})}{n_j}) }[/math]

  • notation

[math]\displaystyle{ f_{i,k,j} = }[/math] frequency of reference allele in group i, replicate and SNP j.

[math]\displaystyle{ \bar{f_{i,k}} = }[/math] vector of frequencies

Without loss of generality, we assume that the putative selected site is site [math]\displaystyle{ j = 1 }[/math]

  • Model at selected site

[math]\displaystyle{ log \frac{f_{i,k,1}}{1-f_{i,k,1}} = \mu + \beta g_i + \epsilon_{i,k} }[/math]

  • Model at all other sites