Data Files

From OpenWetWare

(Difference between revisions)
Jump to: navigation, search
(Alignment)
Line 1: Line 1:
 +
[[Drosophila 16S Paper]]
 +
== Sequence files ==
== Sequence files ==
*These are the quality-trimmed reads that did not have enough overlap to assemble (thus "frags").
*These are the quality-trimmed reads that did not have enough overlap to assemble (thus "frags").

Revision as of 20:54, 15 January 2010

Drosophila 16S Paper

Contents

Sequence files

  • These are the quality-trimmed reads that did not have enough overlap to assemble (thus "frags").

Image:Fly.frags

  • These are the quality-trimmed reads that did not have enough overlap to assemble and had a significant blast hit to the "left" side of a reference 16S sequence.

Image:Fly.frags.left

  • These are the quality-trimmed reads that did not have enough overlap to assemble and had a significant blast hit to the "right" side of a reference 16S sequence.

Image:Fly.frags.right

  • These are the reads that assembled into complete clones using the JGI's 16S pipeline (genelib).

Image:Fly.contigs

  • NAST-aligned sequences from the Corby-Harris paper.

Image:Corby.NAST.aligned.fasta

  • NAST-aligned sequences from the Cox and Gilmore paper.

Image:Cox.NAST.aligned

  • A fasta file of all the Clean Chimera-checked sequenced that were unclassified at the Genus level --James Angus Chandler 19:16, 24 April 2009 (EDT)

Media:Unclassified.fasta.gz

Taxonomy Assignments

  • OLD, not quality-trimmed data: there are three files here, each corresponds to one of the three chimera-checked sequence files above (i.e., putative, sub-threshold, and clean)
  1. clean sequences Image:Classifications All NAST.Bclean.fasta30593.xls
  2. sub-threshold chimeric sequences Image:Classifications All NAST.Bambig.fasta27959.xls
  3. putative chimeric sequences Image:Classifications All NAST.Bchimera.fasta28223.xls

Alignment

  • below is the most current version of the NAST-formatted alignment file.

this is the older, not quality-trimmed file. It contains all of our sequences, plus the Corby-Harris and Cox-Gilmore sequences.

Image:All.good.gz


  • This is a concatenated alignment with the quality-trimmed data. Each half was aligned using the NAST aligner and then both halves were concatenated. There is a reference sequence in there (called testseq) that was used to decide where to end the left half and begin the right half before concatenating. This alignment does not include all of the full-length sequences that were assembled with genelib.

Image:Fly.merged.fasta

Metadata Files

Here is the environment file you asked for. Look it over to tell me if I need to add anything. I left ??? for the Cox-Gilmore samples since I cannot seem to find my copy of it and thus do not know what they collected over.

Media:MainEnvFile.xls --James Angus Chandler 20:45, 19 May 2009 (EDT)

  • This file allows the translation from JGI clone IDs to our sample IDs.

Image:May09.trans.xls

Phylogenetic Tree

CalTech Presentation

Media:CalTech_Presentation.ppt --James Angus Chandler 00:56, 11 June 2009 (EDT)

Personal tools