User:The Biology Group: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
Line 25: Line 25:
[[Media: Eye_color_3.pdf |Pigmentation Paper]] and accompanying [[Media: Pigmentation.xlsx |SNP Spreadsheet]]
[[Media: Eye_color_3.pdf |Pigmentation Paper]] and accompanying [[Media: Pigmentation.xlsx |SNP Spreadsheet]]


[[Media: Aggregate_gwas_studies.xlsx| Aggregate GWAS Studies]] (See tool portion for more information)
[[Media: Aggregate_gwas_studies.xlsx| Aggregate GWAS Studies Spreadsheet]] (See tool portion for more information)


[[Media: Sandhu2008Lancet_LDL.pdf| LDL and Cholesterol GWAS study]] and accompanying [[Media: LDL_Cholesterol_Data.xls| SNP Spreadsheet]]
[[Media: Sandhu2008Lancet_LDL.pdf| LDL and Cholesterol GWAS study]] and accompanying [[Media: LDL_Cholesterol_Data.xls| SNP Spreadsheet]]

Revision as of 23:30, 8 December 2009

Bold textHey! So in order to make it easier and more accessible, we have moved all discussion surrounding the "Biology Aspect" of our potential project to this page.

Enjoy!

Contact Information

  • Ridhi Tariyal (ridhitariyal@gmail.com)
  • Jackie Nkuebe (jnnkuebe@fas.harvard.edu)
  • Joseph Torella (jtorella@gmail.com)
  • Anugraha Raman (amraman@fas.harvard.edu)
  • Anna Turetsky (turetsky@fas.harvard.edu)

Group Description

1. Coming up with concrete examples of epistatic interactions in polygenic traits.

2. Developing a method (in conjunction with the math modeling group) to accurately predict the correlation between disease risk and SNP "hierarchy" in human disease.

3. Determining the feasibility of given models.

Final Progress

Relevant Literature

Eye color and the Prediction of Complex Phenotypes from Genotypes

Pigmentation Paper and accompanying SNP Spreadsheet

Aggregate GWAS Studies Spreadsheet (See tool portion for more information)

LDL and Cholesterol GWAS study and accompanying SNP Spreadsheet

Presentations Final Presentation

[[Media: |Intermediate Presentation]]

Tool for finding SNPs and Relevant Literature

Formulated Data Set

We attempted to get genomes of actual people accompanied by phenotypic data; however we were unable to get that. The author of the eye color study told us that if we wrote a proposal we would most likely be able to obtain this data ( See Rotterdam Proposal Section)

In order to aid the modeling group we created data sets. The data set shown below is one for eye color. Taking 20 individuals with twelve different SNPs inmportant to eye color we created a genotypic matrix. For each SNP, the individual could be homozygous dominant, heterozygous, homozygous recessive. In order to accomidate for all three cases we listed for each SNP 2 categories of homozygous dominant and heterozygous. We marked 0 if they didn't have the trait and 1 if they did have the trait. By process of elimination, or by the presence of a one we could accomadate for three possibilies using a binary system.

We then looked at the phenotypes yielded by each row. In this case blue, brown, intermediate. We then devised simple mathematical rules using all SNPs in question to come up with values. Ranges of values were then correlated with different phenotypes. The goal of the modeling team is to see if they can generate these rules using their model.

Data Set

Rotterdam Proposal and Other Outreach Efforts

Ridhi contacted Drs. Kayser and Liu, authors of a paper on eye color and the prediction of complex phenotypes from genotypes. When asked how to obtain an anonymous, yet real data set, with genotypic and corresponding phenotypic data for the purpose of testing statistical models, they responded telling us to write a proposal to the management team of the Rotterdam Study. However, they did indicate that since certain expectations from the researchers requesting the data are usually in place before such data can be given out, there is the possibility that we wouldn't be given this data set. Dr.Liu stressed that creating dummy data would not be straightforward due to [disequilibrium], and he suggested to download [[1]] data and create phenotypes based on genotypes at specific loci.

Hereis the proposal that was sent to the management team.

Ridhi also tried contacting Dr.Shriver for a real data set, but we have yet to receive a real data set.

Jacqui contacted Amy Carmargo at the Broad Institute. She works on the genotyping, sequencing and haplotype determination of [|select candidate genes]. Her paper "Association of genetic variants in KCNH2 with QT interval duration in the Framingham Heart Study" was of particular interest to us because this study had a good documentation of the SNP Genotypes and Echocardiographic Phenotypes]. We wanted to see if we could get a real data-set from this study to test our model with.

After it became clear that the next best thing to having the corresponding data sets from these studies, would actually be to download HapMap data, Jacqui was able to successfully view data on SNPs for eye color after downloading HaploView.

In class Professor Church had mentioned the problem of chromosome location standardization. Since documentation has not been standardized, different locations in different studies that correlate snps with phenotypes could actually be addressing the same chromosomal location. In order to address issues related to this we contacted Bruce Birren who works on genome-wide mapping and sequencing programs in humans and directed sequencing projects for microbes at the Broad Institute.

Future Directions and Cool Applications (Wish List)