User:The Biology Group: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
Line 45: Line 45:
Ridhi contacted Drs. Kayser and Liu, authors of a paper on [[Media: Liu_Paper.pdf|eye color and the prediction of complex phenotypes from genotypes]]. When asked how to obtain an anonymous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models, they responded telling us to write a proposal to the management team of the Rotterdam Study.  However, they did indicate that since certain expectations from the researchers requesting the data are usually in place before such data can be given out, there is the possibility that we wouldn't be given this data set.  Dr.Liu stressed that creating dummy data would not be straightforward due to [[http://en.wikipedia.org/wiki/Linkage_disequilibrium|linkage disequilibrium]], and he suggested to download [[http://hapmap.ncbi.nlm.nih.gov|HapMap]] data and create phenotypes based on genotypes at specific loci.
Ridhi contacted Drs. Kayser and Liu, authors of a paper on [[Media: Liu_Paper.pdf|eye color and the prediction of complex phenotypes from genotypes]]. When asked how to obtain an anonymous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models, they responded telling us to write a proposal to the management team of the Rotterdam Study.  However, they did indicate that since certain expectations from the researchers requesting the data are usually in place before such data can be given out, there is the possibility that we wouldn't be given this data set.  Dr.Liu stressed that creating dummy data would not be straightforward due to [[http://en.wikipedia.org/wiki/Linkage_disequilibrium|linkage disequilibrium]], and he suggested to download [[http://hapmap.ncbi.nlm.nih.gov|HapMap]] data and create phenotypes based on genotypes at specific loci.


Below is the proposal that was sent to the management team:
[[Media: Proposal_for_Rotterdam_Data.doc | Here]]is the proposal that was sent to the management team:
 
[[Media: Proposal_for_Rotterdam_Data.doc | Proposal]]


Ridhi also tried contacting Dr.Shriver for a real data set, but we have yet to receive a real data set.
Ridhi also tried contacting Dr.Shriver for a real data set, but we have yet to receive a real data set.

Revision as of 22:50, 8 December 2009

Bold textHey! So in order to make it easier and more accessible, we have moved all discussion surrounding the "Biology Aspect" of our potential project to this page.

Enjoy!

Contact Information

  • Ridhi Tariyal (ridhitariyal@gmail.com)
  • Jackie Nkuebe (jnnkuebe@fas.harvard.edu)
  • Joseph Torella (jtorella@gmail.com)
  • Anugraha Raman (amraman@fas.harvard.edu)
  • Anna Turetsky (turetsky@fas.harvard.edu)

Group Description

1. Coming up with concrete examples of epistatic interactions in polygenic traits.

2. Developing a method (in conjunction with the math modeling group) to accurately predict the correlation between disease risk and SNP "hierarchy" in human disease.

3. Determining the feasibility of given models.

Final Progress

Relevant Literature

Eye color and the Prediction of Complex Phenotypes from Genotypes

Pigmentation Paper and accompanying SNP Spreadsheet

Aggregate GWAS Studies (See tool portion for more information)

Presentations

Tool for finding SNPs and Relevant Literature

Formulated Data Set

We attempted to get genomes of actual people accompanied by phenotypic data; however we were unable to get that. The author of the eye color study told us that if we wrote a proposal we would most likely be able to obtain this data ( See Rotterdam Proposal Section)

In order to aid the modeling group we created data sets. The data set shown below is one for eye color. Taking 20 individuals with twelve different SNPs inmportant to eye color we created a genotypic matrix. For each SNP, the individual could be homozygous dominant, heterozygous, homozygous recessive. In order to accomidate for all three cases we listed for each SNP 2 categories of homozygous dominant and heterozygous. We marked 0 if they didn't have the trait and 1 if they did have the trait. By process of elimination, or by the presence of a one we could accomadate for three possibilies using a binary system.

We then looked at the phenotypes yielded by each row. In this case blue, brown, intermediate. We then devised simple mathematical rules using all SNPs in question to come up with values. Ranges of values were then correlated with different phenotypes. The goal of the modeling team is to see if they can generate these rules using their model.

Data Set

Rotterdam Proposal and Other Efforts to Obtain Real Data Sets

Ridhi contacted Drs. Kayser and Liu, authors of a paper on eye color and the prediction of complex phenotypes from genotypes. When asked how to obtain an anonymous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models, they responded telling us to write a proposal to the management team of the Rotterdam Study. However, they did indicate that since certain expectations from the researchers requesting the data are usually in place before such data can be given out, there is the possibility that we wouldn't be given this data set. Dr.Liu stressed that creating dummy data would not be straightforward due to [disequilibrium], and he suggested to download [[1]] data and create phenotypes based on genotypes at specific loci.

Hereis the proposal that was sent to the management team:

Ridhi also tried contacting Dr.Shriver for a real data set, but we have yet to receive a real data set.

Jacqui contacted Amy Carmargo at the Broad Institute. She works on the genotyping, sequencing and haplotype determination of [|select candidate genes]. Her paper "Association of genetic variants in KCNH2 with QT interval duration in the Framingham Heart Study" was of particular interest to us because this study had a good documentation of the SNP Genotypes and Echocardiographic Phenotypes]. We wanted to see if we could get a real data-set from this study to test our model with.

After it became clear that the next best thing to having the corresponding data sets from these studies, would actually be to download HapMap data, Jacqui was able to successfully view data on SNPs for eye color after downloading HaploView.