Brandon J. Klein Electronic Lab Notebook: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
(Transferred over notes from weeks 1-3)
(added tasks for Week 4)
Line 42: Line 42:


===Week 4: September 19--September 26===
===Week 4: September 19--September 26===
'''Trajectory Update:'''
#We will focus on the 5 different "real 15-gene networks" generated last year. We will go through these input sheets and double check rates as well as formatting to make them as perfect as possible before working with them.
#We can start doing production runs. We will do this with the 5 real networks as well as random networks.
'''Tasks:'''
'''Tasks:'''
 
#Read the bar chart module in chapter 2 of the [http://petersonbiology.com/math230Notes/ R Tutorial].
 
#Begin writing a degree distribution generator script.
#Work on TRACE documentation.


==Spring 2016==
==Spring 2016==

Revision as of 18:57, 21 September 2016

Fall 2016

Week 1: August 29--September 5

Trajectory for the Semester:

  • Goal: Work towards validating the current literature network or finding the "true" network regulating the response to cold shock in S. cerevisiae.
    • Assess models of small "real networks" and compare to "random networks".

Tasks:

  1. Read GRNsight paper.
  2. Automate the creation of random networks in R.

Progress:

  1. Reviewed GRNsight paper.
  2. Investigated ways to code for the creation of random networks in R.

Week 2: September 5--September 12

Tasks:

  1. Read and consider adding the Contributor Covenant Code of Conduct to GRNmap.
  2. Develop an R script for the creation of random networks for input into GRNmap.
  3. Review the GRNmap paper published in the Bulletin of Mathematical Biology.

Progress:

  1. Read through and approved the Contributor Covenant Code of Conduct.
  2. Developed a functional script for creation of random matrices and posted it to GitHub.
  3. Reviewed GRNmap paper.

Week 3: September 12--September 19

Tasks:

  1. Make changes to the random network generator script:
    • Change terminology within the script to match GRNmap nomenclature (e.g. change "connections" to "edges").
    • Make note of the fact that GRNmap only accepts symmetric adjacency matrices.
    • Prompt users to set their working directory before outputting random matrices.
    • Refine comments within the code.
    • Add sample headers where gene names can be inserted.
  2. Work through chapter 1 of Dr. Dahlquist's recommended R tutorial.
  3. Begin work on TRACE documentation for GRNmap.

Progress:

  1. Refined and expanded the functionality of my random network generator.
  2. Worked through chapter 1 of the R tutorial.
  3. Chose sections of the TRACE documentation to work on.

Week 4: September 19--September 26

Trajectory Update:

  1. We will focus on the 5 different "real 15-gene networks" generated last year. We will go through these input sheets and double check rates as well as formatting to make them as perfect as possible before working with them.
  2. We can start doing production runs. We will do this with the 5 real networks as well as random networks.

Tasks:

  1. Read the bar chart module in chapter 2 of the R Tutorial.
  2. Begin writing a degree distribution generator script.
  3. Work on TRACE documentation.

Spring 2016

Week 1: January 15--22

Research Meeting Notes

  • Early Milestones
    1. Branch Clean-Up
    2. Organize Test Files
    3. Adjust Automated L-Curve Analysis Code
    4. Address Bugs in the Code
  • Assignments
    1. Set-Up OpenWetware User Page and Electronic Lab Notebook
    2. Alphabetize Genes in the Test Files
      • On the network sheets, use the following method: alphabetize column, transpose data, alphabetize new column, transpose back.
    3. Ensure that All Expression Data is Complete in the Test Files
      • Include Bell Data on expression and degradation rates.
    4. Address Missing Values in the Test Files
      • Highlight these cells in yellow and paste in averages.

Note: For assignments 3-4 I am to shadow Tessa and Kristen.

Progress

  • Assignment 1: Set-Up OpenWetware User Page and Electronic Lab Notebook
  • Assignments 2-4: Shadow Tessa & Kristen as they Update the Test Files
    • I met with Tessa & Kristen on Wednesday, January 20th.
      • I watched and asked questions as they made the following edits to the test files: gene names were alphabetized, expression data was completed, and missing values were addressed using the designated fix.
      • I contributed as well by introducing an Excel method that finds & highlights missing values using Conditional Formatting.
    • Method for highlighting missing values using Conditional Formatting (adapted from this forum):
      1. Select the data you would like to edit
      2. Go to Home > Conditional Formatting > New rule
      3. Click on “Format only cells that contain”
      4. Change “Cell Value” option to “Blanks”
      5. Set up formatting you want by clicking on Formatting button
        • In this case we introduce a yellow fill.
      6. Click ok.

Week 2: January 22--29

Research Meeting Notes

  • Coding Updates
    • In approx. 2 weeks, the data analysis team will use the Master branch on GitHub to access GRNmap code
    • Current input-sheet format will be the same as presently used for the Beta branch
    • Goal: be able to run models by next Friday
  • Assignments
    • Download the Beta branch and try to run a newly formatted input-sheet (has to be on a PC)
      • This can be used to identify errors (if any) are present prior to the upcoming update to the code
      • Process:
        1. Go to code in GitHub
        2. Go to the Beta branch
        3. Extract the code as a .zip file
        4. Open code in MatLab
        5. Open Input Sheet in MatLab
    • Read ecological modelling standards paper
    • Go to modelling standards web page and contribute based on your reading of the paper
    • Do research to gain a better understanding of how GRNmap works
    • Update work that was done on GitHub

Progress

Week 3: January 29--February 5

Research Meeting Notes

  • Assignments
    • Help Kristen catch up with formatting her input sheets
    • Add new formatting changes
      • See GitHub to reference 3 necessary changes to input sheets
    • Generate L-curve Analyses
      • 4 Total
        1. Largest network + deletion strain
        2. Largest network - deletion strain
        3. Smallest network + deletion strain
        4. Smallest network - deletion strain
      • Graphs will have to be plotted manually in Excel
        • LSE vs. penalty with each point's alpha value labelled
      • To bypass a temporary bug, make_graphs may have to be turned off
        • Once this function has been fixed, we will be able to do mass data generation.
        • Use Beta branch for once more week.

Progress

  • Meeting with Tessa on February 1, 2016.
    1. The 4 designated input sheets for L-curve analysis were properly formatted using the updated guidelines. Instances in which errors were triggered in GRNmap were resolved by searching for and correcting formatting errors in the input sheets.
    2. All 4 L-curve analyses were started and left running on the machines in Seaver 120 (with notes not to disturb these processes).
    3. Tessa and I went over the format of GRNmap research and the overall systems biology workflow of the Dahlquist Lab.
  • Meeting with Tessa and Kristen on February 3, 2016.
    • 4/10 L-curve analyses were complete
    • I created a command sequence in R that can be used to output labelled graphs of the L-curves.

Week 4: February 5--February 12

Research Meeting Notes

  • Goal: Complete L-curve analyses to enable selection of alpha values
    • Several runs remain incomplete, but they have completed enough iterations to produce usable data
    • Data from all runs will be compiled and formatted for R
    • The above data will be used to generate a complete set of L-curves for the 4 runs from all tested families of networks

Progress

  • Graphing L-curves
    • I helped members of the research team format their data for R and showed those with knowledge of R how to execute the command sequence to generate their L-curves
    • I personally graphed many of the L-curves
  • I compiled a Powerpoint presentation of all L-curves generated this week: Media:LcurveAnalyses 20160205.pptx

Week 5: February 12--February 19

Research Meeting Notes

  • Goals
    • Begin the Dahlquist:Microarray Data Analysis Workflow (focus: dCIN5 and WT)
      • Check normalization against Tessa's normalization data
      • Set up a meeting with Maggie to go through this workflow in parallel
      • Account for missing values during ANOVA testing if time permits
    • Create a script for generating parameter plots in R (for different alpha values) if time permits

Progress

  • Began the Microarray Data Analysis Workflow
  • Updated the R command sequence used to generate L-curves
    • This sequence was scripted to automatically generate L-curves
    • The script is interactive, prompting users to input the pathway to the data and provide a name for the graph
    • An updated description can be found here: Graphing L-Curves in R

Week 6: February 19--February 26

Research Meeting Notes Next Meeting: Friday February 26 (2:30-3:30)

  • Goals:
    • Continue the Dahlquist:Microarray Data Analysis Workflow with Maggie and cross-check results
    • Write a script in R to convert adjacency matrices to edge lists
    • If time permits...
      • Account for missing values during ANOVA testing
      • Create a script for generating parameter plots in R

Progress


Important Links