20.109(S07): Start-up expression engineering

From OpenWetWare
Revision as of 08:58, 30 December 2006 by Nkuldell (talk | contribs)
Jump to navigationJump to search


20.109: Laboratory Fundamentals of Biological Engineering

Home        People        Schedule Spring 2007        Lab Basics        OWW Basics       
Genome Engineering        Biophysical Signal Measurement        Expression Engineering        Biomaterial Engineering       

Introduction

From your work so far this term, you have a good understanding of (at least) two fundamental concepts. From our first experimental module it should be clear that the genetic program for running a cell is readable (through sequencing), writable (through molecular biological techniques and synthesis) and somewhat, though not perfectly understandable. Recall how a genetic part can be as small as 13 base pairs, BBa_B0032 for example, to help carry out the work of protein production by enabling a huge protein/RNA complex, the ribosome, to bind an RNA molecule. From our second experimental module, it should be clear that a cell's programming doesn't end at protein production but rather that proteins are dynamic (chemically and spatially). They react to changes in the envirnoment with great speed and sensitivity. We saw that proteins could be considered "digital" information, either present or absent, but are perhaps better thought of as "tunable" analog data since, as far as the cell is concerned, their relevance is their function, regulated by stability, localization and modification. Thus, from the work we've done so far this term, you may have the idea that gene expression in a cell is powered by the central dogma (DNA making RNA making protein) and then modulated by varying the properties of proteins once they are made. This strategy minimizes reaction times but is, energetically speaking, quite wasteful. Why make a protein if it's not useful? In this experimental module we'll see how nature has refined the genetic programming language so a cell's proteins are not all constitutively expressed but rather are finely regulated at the level of transcription initiation. This turns out to be only one of many points for regulation but it's an important one.

30 nm chromatin fiber, from Robinson et al PNAS 2006


At the same time we'll see how nature has overcome another design issue, namely space constraints. A cell's dimensions do not increase linearly with DNA content. Instead, eukaryotic cells remain compartmentalized and compact the DNA by wrapping it around assemblies of histone proteins called nucleosomes. Nucleosomes wrap around eachother to form chromatin. This solves the space issue, allowing 3 feet of DNA to be crammed into a space perhaps 10 um across, but creates a new problem. Wrapped DNA is less accessible to the transcription and replication machinery. Gene expression becomes newly and intimately related to chromatin dynamics. This new problem is overcome by other multiprotein complexes that interact with the DNA-wrapping proteins. Nucleosomes are redistributed around genes that are "active" though it remains unclear if this redistribution is a cause or a consequence of the activity. We'll study one chromatin-remodeling complex called SAGA in this experimental module.

a model for SAGA's interaction with transcription factors and nucleosome-bound DNA, image courtesy of P. Schultz


The name "SAGA" is an acronym for "Spt-Ada-Gcn5-acetyltransferase." Before describing each of these components, it's important to note that biochemically similar complexes are found in many (all?) of the eukaryotic cells that have been studied. Even more remarkable, these SAGA complexes have similar (identical?) numbers of protein subunits and the proteins have noteable sequence homologies, suggesting conserved functions even in cells with diverse life-styles like yeast and human cells. Natural processes like development and division and disease states like cancer can be understood at the level of transcription (mis)regulation. Chromatin remodeling is required for appropriate gene expression which is, in turn, required for healthy cell behaviors. Thus, there is good reason to believe that an understanding of how SAGA works in yeast can give us insight into its role in cells more medically relevant, like human.


A combination of biochemical and genetic data initially suggested that an enzymatic activity, namely a histone-acetyl transferase, encoded by the GCN5 gene in S. cerevisiae existed as a large protein complex called that the authors named "SAGA" [Grant et al 1997]. Nineteen proteins, including GCN5, associate to form SAGA, though surprisingly not all the proteins are absolutely required for the cell to live. Delete a nonessential subunit and the cell can still grow and divide, although sometimes with impaired functions. A table of all 19 SAGA subunits, including information about their essential or non-essential nature, is included as part of today's protocol. A recent structure for the complex was elucidated through electron microscopy [Wu et al 2004]. The SAGA structure, as shown above, can be imagined to "dock" with the DNA and associated proteins, allowing for some very elegant models of how chromatin-modifications are performed and regulated. Many terrific and exciting experiments can be imagined to test these models. Your experiment will be a systematic examination of the non-essential subunits and their role in gene expression. Today you will choose one of the nonessential subunits to delete from the yeast genome, and design some primers to generate a null mutant. Later in this experimental module, you will examine your mutated strain for changes in gene expression, looking for new phenotypes associated with the SAGA-subunit deletion as well as looking by DNA microarray for genes whose expression is affected by the loss of this subunit.

Protocols

Part 1: Choosing a SAGA subunit

You should begin by learning a little about the different classes of SAGA subunits. What does "Spt" stand for (and how do you pronounce that?!)? What is a TAF? Why is Ada5 also called Spt20?
The nomenclature for S. cerevisiae is precise and helpful. Wild type genes are normally given an italicized, three letter acronym based on the phenotype of a mutation in that gene. So a HIS gene is unable to make histidine if the gene is defective (Of course, dead cells all have only one phenotype so this presumes loss of the gene product doesn't kill the cell...). Since there exist several genes that can give rise to similar phenotypes, associated genes are given a number, e.g. HIS3, HIS4,, etc. To describe recessive mutant alleles, lower case letters are used. So a strain that is his3 has a mutation that affects the function of the HIS3 gene. Since there can be several different mutations described for any given gene, a second number gets associated with the mutant, e.g. his3-1 or his3delta200. Proteins are distinguished from DNA by capitalizing only the first letter of the gene product: the HIS3 gene makes the His3 protein. Naturally there are exceptions to these rules, but in general you can pretty confidently follow them.

Begin aquainting yourself with the SAGA subunits by copying the following table to your wiki userpage, then finding relevant information in the Saccharomyces Genome Database for the subunits. You can also search Pubmed to find out a little more about the role of the subunits in gene expression. Do not shortchange yourself on this part of the experiment, since you will be working with the subunit you choose today for the rest of the module.

SAGA subunits, S. cerevisiae

Ada subunits size,chromosome,null p-type notes
Ada1 (aka HFI1, SUP110, SRM12, GAN1) 1.467 kb/489 aa, Chr. XVI,
viable
Ada2 (aka SWI8) 1.305 kb/434aa, Chr. IV,
viable
Ada3(aka NGG1, SWI7) 2.109 kb/702aa, Chr. IV,
viable
Gcn5 (aka ADA4, SWI9) 1.32 kb/439aa, Chr. VII,
viable
Ada5 (aka SPT20) 1.815 kb/604aa, Chr. XV,
viable
Spt subunits size, chromosome, null p-type notes
Spt3 1.014 kb/337aa, Chr. IV,
viable
Spt7(aka GIT2) 3.999 kb/1332aa, Chr. II,
viable
Spt8 1.809 kb/602aa, Chr. XII,
viable
Spt20 (aka Ada5) 1.815 kb/604aa, Chr. XV,
viable
TAF subunits size, chromosome, null p-type
TAF5 (aka TAF90) 2.397 kb/798aa, Chr. II, inviable
TAF6 (aka TAF60) 1.551 kb/516aa, Chr. VII, inviable
TAF9 (aka TAF17) 0.474 kb/157aa, Chr. XIII, inviable
TAF10 (aka TAF23, TAF25) 0.621 kb/206aa, Chr. IV, inviable
TAF12(aka TAF61, TAF68) 1.620 kb/539aa, Chr. IV, inviable
Tra1 subunit size, chromosome, null p-type notes
Tra1 11.235 kb/3744aa, Chr. VIII, inviable
other subunits size, chromosome, null p-type notes
Sgf73 1.974 kb/657aa, Chr. VII ,
viable
Sgf29 0.779 kb/259aa, Chr. III,
viable
Sgf11 0.3 kb/99aa, Chr.XVI,
viable
Ubp8 1.416 kb/471aa, Chr. XIII,
viable
Sus1 gene with intron, Chr. II,
viable

Part 2: Designing deletion oligos

Part 3: PCR

  1. pRS406 template

DONE!

For next time

Reagents list