Dahlquist:Notebook/Microarray Data Analysis/2008/10/21: Difference between revisions
From OpenWetWare
Jump to navigationJump to search
Line 22: | Line 22: | ||
*** wt-vs-dCIN5_Edge_covariates_20081021.txt | *** wt-vs-dCIN5_Edge_covariates_20081021.txt | ||
''' Reran the dCIN5-vs-wt data with the updated covariate file:''' | |||
* | * Gene file in Desktop "Data analysis 2008-10-02", Covariate file on Desktop | ||
** Used gene file "wt-dCIN5_consolidated_Edge_genes-indexonly_20080715.txt" | ** Used gene file "wt-dCIN5_consolidated_Edge_genes-indexonly_20080715.txt" | ||
** Used covariate file "wt- | ** Used covariate file "wt-dCIN5_consolidated_Edge_covariates_20081021.txt | ||
* Load both into an Edge session. | * Load both into an Edge session. | ||
* Select "Impute Missing Data" from the menu. Calculate Percent Missing Data by clicking on the button. The results are: | * Select "Impute Missing Data" from the menu. Calculate Percent Missing Data by clicking on the button. The results are: | ||
Line 37: | Line 37: | ||
* Selected "Identify Differentially Expressed Genes" | * Selected "Identify Differentially Expressed Genes" | ||
** Class Variable is: Strain | ** Class Variable is: Strain | ||
** Differential Expression Type is: | ** Differential Expression Type is: Time Course | ||
** Number of null iterations, set to 1000 | ** Number of null iterations, set to 1000 | ||
** Choose a seed for reproducible results, set to 47 | ** Choose a seed for reproducible results, set to 47 | ||
** click "Go" | ** Choose Time Course Settings | ||
** 1000 permutations looks like it will take about | ** Covariate giving time points is: Timepoint | ||
** Covariate corresponding to individuals is: Flask | |||
** Choose spline type, accepted default of Natural Cubic Spline, dimension 4 | |||
* Results: (Saved in ) | ** Click "Apply" and then click "Go" | ||
** | ** 1000 permutations looks like it will take about 9 minutes. | ||
* Results: (Saved in 2008-10-14 Results) | |||
** No significant genes under these settings. | |||
** Choose Q-Value cutoff as 1, recalculate | ** Choose Q-Value cutoff as 1, recalculate | ||
*** Saved total list of genes as: "" | *** Saved total list of genes as: "GeneList_20081014_wt-vs-dCIN5" | ||
** To save the plots, do the following command in the R console window. | ** To save the plots, do the following command in the R console window. | ||
savePlot(filename = "PvalHistogram_wt-vs-dCIN5", type = c("png"), device = dev.cur()) | savePlot(filename = "PvalHistogram_wt-vs-dCIN5", type = c("png"), device = dev.cur()) | ||
* This will save the active plot window under a file name you choose. Saves in folder "edge_1.1.290" | * This will save the active plot window under a file name you choose. Saves in folder "edge_1.1.290" | ||
** Saved Q-Plot as "" | ** Saved Q-Plot as "QPlot_20081014_wt-vs-dCIN5" | ||
** Saved Histograms as " | ** Saved Histograms as "PvalHistogram_20081014_wt-vs-dCIN5 | ||
'''Then dCIN5 dataset was ran on its own:''' | |||
* Files in "Edge_data_20080710" | |||
** Used gene file "dCIN5-only_Edge_genes-indexonly_20080715.txt" | |||
** Used covariate file "dCIN5-only_Edge_covariates_20080710.txt" | |||
* Load both into an Edge session. | |||
* Select "Impute Missing Data" from the menu. Calculate Percent Missing Data by clicking on the button. The results are: | |||
** Percent of genes missing data: 1.32% | |||
** Percent of arrays missing data: 90% | |||
** Overall percent of missing data: 0.09% | |||
* For KNN Parameters, set: | |||
** Percent of missing values to tolerate in a gene: 100 (so all genes included) | |||
** Number of nearest neighbors to use (maximum of 15): 15 | |||
** clicked GO to impute missing data. | |||
* Selected "Identify Differentially Expressed Genes" | |||
** Class Variable is: None (within class differential expression) | |||
** Differential Expression Type is: Time Course | |||
** Number of null iterations, set to 1000 | |||
** Choose a seed for reproducible results, set to 47 | |||
** Choose Time Course Settings | |||
** Covariate giving time points is: Timepoint | |||
** Covariate corresponding to individuals is: Flask | |||
** Choose spline type, accepted default of Natural Cubic Spline, dimension 4 | |||
** Click "Apply" and then click "Go" | |||
** 1000 permutations looks like it will take about 2 minutes. | |||
* Results: (Saved in 2008-10-14 Results) | |||
** 157 Genes Called Significant (Cutoff Q Value 0.1) | |||
** Saved total list of genes as "GeneList_20081014_dCIN5-only" | |||
** Saved Q-Plot as "QPlot_20081014_dCIN5-only" | |||
** Saved Histograms as "PvalHistogram_20081014_dCIN5-only" | |||
Revision as of 13:54, 21 October 2008
Microarray Data Analysis | <html><img src="/images/9/94/Report.png" border="0" /></html> Main project page <html><img src="/images/c/c3/Resultset_previous.png" border="0" /></html>Previous entry<html> </html>Next entry<html><img src="/images/5/5c/Resultset_next.png" border="0" /></html> |
Today's WorkflowThe results generated on 10/14/2008 were downloaded and placed on the Desktop in "Edge Analysis" in Kevin's profile. Significant gene results were saved as tab-delimited files and the Pvalue Histograms and QPlots were saved into a powerpoint and printed.
Previous run (10/14/2008) on dCIN5-only dataset gave interesting results. While the wt-only dataset produced about 1000 significant genes, the dCIN5-only one gave about 150 significant genes. To verify this result:
Reran the dCIN5-vs-wt data with the updated covariate file:
savePlot(filename = "PvalHistogram_wt-vs-dCIN5", type = c("png"), device = dev.cur())
Then dCIN5 dataset was ran on its own:
|