User:The Biology Group: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
Line 41: Line 41:
'''Rotterdam Proposal'''
'''Rotterdam Proposal'''


Ridhi contacted Dr. Kaysers and Liu, authors of a paper on [[Media: Liu_Paper.pdf|eye color and the prediction of complex phenotypes from genotypes]]. When asked for how to obtain an anoynomous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models on, they responded telling us to write a proposal to the management team of the Rotterdam Study.  However, they did indicate that since certain expectations are usually in place when such data is given out, there is the possibility that we wouldn't be given this data set.  Dr.Liu stressed that creating dummy data would not be straightforward due to [[http://en.wikipedia.org/wiki/Linkage_disequilibrium|linkage disequilibrium]], and he suggested to download HapMap data and create phenotypes based on genotypes at specific loci.
Ridhi contacted Drs. Kayser and Liu, authors of a paper on [[Media: Liu_Paper.pdf|eye color and the prediction of complex phenotypes from genotypes]]. When asked how to obtain an anonymous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models, they responded telling us to write a proposal to the management team of the Rotterdam Study.  However, they did indicate that since certain expectations from the researchers requesting the data are usually in place before such data can be given out, there is the possibility that we wouldn't be given this data set.  Dr.Liu stressed that creating dummy data would not be straightforward due to [[http://en.wikipedia.org/wiki/Linkage_disequilibrium|linkage disequilibrium]], and he suggested to download [[http://hapmap.ncbi.nlm.nih.gov|HapMap]] data and create phenotypes based on genotypes at specific loci.


Below is the proposal that was sent to the management team:
Below is the proposal that was sent to the management team:


[[Media: Proposal_for_Rotterdam_Data.doc | Proposal]]
[[Media: Proposal_for_Rotterdam_Data.doc | Proposal]]

Revision as of 21:19, 8 December 2009

Bold textHey! So in order to make it easier and more accessible, we have moved all discussion surrounding the "Biology Aspect" of our potential project to this page.

Enjoy!

Contact Information

  • Ridhi Tariyal (ridhitariyal@gmail.com)
  • Jackie Nkuebe (jnnkuebe@fas.harvard.edu)
  • Joseph Torella (jtorella@gmail.com)
  • Anugraha Raman (amraman@fas.harvard.edu)
  • Anna Turetsky (turetsky@fas.harvard.edu)

Group Description

1. Coming up with concrete examples of epistatic interactions in polygenic traits.

2. Developing a method (in conjunction with the math modeling group) to accurately predict the correlation between disease risk and SNP "hierarchy" in human disease.

3. Determining the feasibility of given models.

Final Progress

Relevant Literature

Pigmentation Paper and accompanying SNP Spreadsheet

Aggregate GWAS Studies (See tool portion for more information)

Presentations

Tool for finding SNPs and Relevant Literature

Formulated Data Set

We attempted to get genomes of actual people accompanied by phenotypic data; however we were unable to get that. The author of the eye color study told us that if we wrote a proposal we would most likely be able to obtain this data ( See Rotterdam Proposal Section)

In order to aid the modeling group we created data sets. The data set shown below is one for eye color. Taking 20 individuals with twelve different SNPs inmportant to eye color we created a genotypic matrix. For each SNP, the individual could be homozygous dominant, heterozygous, homozygous recessive. In order to accomidate for all three cases we listed for each SNP 2 categories of homozygous dominant and heterozygous. We marked 0 if they didn't have the trait and 1 if they did have the trait. By process of elimination, or by the presence of a one we could accomadate for three possibilies using a binary system.

We then looked at the phenotypes yielded by each row. In this case blue, brown, intermediate. We then devised simple mathematical rules using all SNPs in question to come up with values. Ranges of values were then correlated with different phenotypes. The goal of the modeling team is to see if they can generate these rules using their model.

Data Set

Rotterdam Proposal

Ridhi contacted Drs. Kayser and Liu, authors of a paper on eye color and the prediction of complex phenotypes from genotypes. When asked how to obtain an anonymous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models, they responded telling us to write a proposal to the management team of the Rotterdam Study. However, they did indicate that since certain expectations from the researchers requesting the data are usually in place before such data can be given out, there is the possibility that we wouldn't be given this data set. Dr.Liu stressed that creating dummy data would not be straightforward due to [disequilibrium], and he suggested to download [[1]] data and create phenotypes based on genotypes at specific loci.

Below is the proposal that was sent to the management team:

Proposal