20.109(S07): Start-up expression engineering

From OpenWetWare
Revision as of 09:55, 30 December 2006 by Nkuldell (talk | contribs)
Jump to navigationJump to search


20.109: Laboratory Fundamentals of Biological Engineering

Home        People        Schedule Spring 2007        Lab Basics        OWW Basics       
Genome Engineering        Biophysical Signal Measurement        Expression Engineering        Biomaterial Engineering       

Introduction

From your work so far this term, you have a good understanding of (at least) two fundamental concepts. From our first experimental module it should be clear that the genetic program for running a cell is readable (through sequencing), writable (through molecular biological techniques and synthesis) and somewhat, though not perfectly understandable. Recall how a genetic part can be as small as 13 base pairs, BBa_B0032 for example, to help carry out the work of protein production by enabling a huge protein/RNA complex, the ribosome, to bind an RNA molecule. From our second experimental module, it should be clear that a cell's programming doesn't end at protein production but rather that proteins are dynamic (chemically and spatially). They react to changes in the environment with great speed and sensitivity. We saw that proteins could be considered "digital" information, either present or absent, but are perhaps better thought of as "tunable" analog data since, as far as the cell is concerned, proteins are relevant only when functional, and function can be governed by a regulating protein stability, localization and modifications. Thus, from the work we've done so far this term, you may have the idea that gene expression in a cell is powered by the central dogma (DNA making RNA making protein) and then modulated by varying the properties of proteins once they are made. This strategy minimizes reaction times but is, energetically speaking, quite wasteful. Why make a protein if it's not useful? In this experimental module we'll see how nature has refined the genetic programming language so a cell's proteins are not all constitutively expressed but rather are finely regulated at the level of transcription initiation. This turns out to be only one of many points for regulation but it's an important one.

30 nm chromatin fiber, from Robinson et al PNAS 2006

In the upcoming experiment we'll also see how nature has overcome another design issue, namely space constraints. A cell's dimensions do not increase linearly with DNA content. Instead, eukaryotic cells remain compartmentalized and compact the DNA by wrapping it around assemblies of histone proteins called nucleosomes. Nucleosomes wrap around eachother to form chromatin. This solves the space issue, allowing a meter or so of DNA to be crammed into a space perhaps 10 um across, but creates a new problem. Wrapped DNA is less accessible to the transcription and replication machinery. Gene expression becomes newly and intimately related to chromatin dynamics. This new problem is overcome by other multiprotein complexes that interact with the DNA-wrapping proteins. Nucleosomes are redistributed around genes that are "active" though it remains unclear if this redistribution is a cause or a consequence of the activity. We'll study one chromatin-remodeling complex called SAGA in this experimental module.

a model for SAGA's interaction with transcription factors and nucleosome-bound DNA, image courtesy of P. Schultz


The name "SAGA" is an acronym for "Spt-Ada-Gcn5-acetyltransferase." Before describing each of these components, it's important to note that biochemically similar complexes are found in many (all?) of the eukaryotic cells that have been studied. Even more remarkable, these SAGA complexes have similar (identical?) numbers of protein subunits and the proteins have noteable sequence homologies, suggesting conserved functions even in cells with diverse life-styles like yeast and human cells. Natural processes like development and division and disease states like cancer can be understood at the level of transcription (mis)regulation. Chromatin remodeling is required for appropriate gene expression which is, in turn, required for healthy cell behaviors. Thus, there is good reason to believe that an understanding of how SAGA works in yeast can give us insight into its role in cells more medically relevant, like human.


A combination of biochemical and genetic data initially suggested that an enzymatic activity, namely a histone-acetyl transferase, encoded by the GCN5 gene in S. cerevisiae existed as a large protein complex called that the authors named "SAGA" [Grant et al 1997]. Nineteen proteins, including GCN5, associate to form SAGA, though surprisingly not all the proteins are absolutely required for the cell to live. Delete a nonessential subunit and the cell can still grow and divide, although sometimes with impaired functions. A table of all 19 SAGA subunits, including information about their essential or non-essential nature, is included as part of today's protocol. A recent structure for the complex was elucidated through electron microscopy [Wu et al 2004]. The SAGA structure, as shown above, can be imagined to "dock" with the DNA and associated proteins, allowing us to imagine some very elegant models for how chromatin-modifications might be performed and regulated. Many terrific and exciting experiments can be designed to test these models. Your experiment will be a systematic examination of the non-essential SAGA subunits and their role in gene expression. Today you will design some primers to delete from the yeast genome a nonessential subunit of your choosing. Later in this experimental module, you will examine your mutated strain for changes in gene expression, looking for new phenotypes associated with the SAGA-subunit deletion as well as looking by DNA microarray for genes whose expression is affected by the loss of this subunit.

Protocols

Part 1: Choosing a SAGA subunit

You should begin by learning a little about the different classes of SAGA subunits. What does "Spt" stand for (and how do you pronounce that?!)? What is a TAF? Why is Ada5 also called Spt20?
The nomenclature for S. cerevisiae is precise and helpful. Wild type genes are normally given an italicized, three letter acronym based on the phenotype of a mutation in that gene. So a HIS gene is unable to make histidine if the gene is defective (of course, dead cells all have only one phenotype so this presumes loss of the gene product doesn't kill the cell...). Since there exist several genes that can give rise to similar phenotypes, associated genes are given a number, e.g. HIS3, HIS4,, etc. To describe recessive mutant alleles, lower case letters are used. So a strain that is his3 has a mutation that affects the function of the HIS3 gene. Since there can be several different mutations described for any given gene, a second number gets associated with the mutant, e.g. his3-1 or his3delta200. Proteins are distinguished from DNA by capitalizing only the first letter of the gene product: the HIS3 gene makes the His3 protein. Naturally there are exceptions to these rules, but in general you can pretty confidently follow them.

Begin aquainting yourself with the SAGA subunits by copying the following tables to your wiki userpage, then finding relevant information in the Saccharomyces Genome Database for the subunits. You can also search Pubmed to find out a little more about the role of the subunits in gene expression. Do not shortchange yourself on this part of the experiment, since you will be working with the subunit you choose today for the rest of the module.

SAGA subunits, S. cerevisiae

Ada subunits size,chromosome,null p-type notes
Ada1 (aka HFI1, SUP110, SRM12, GAN1) 1.467 kb/489 aa, Chr. XVI,
viable
Ada2 (aka SWI8) 1.305 kb/434aa, Chr. IV,
viable
Ada3(aka NGG1, SWI7) 2.109 kb/702aa, Chr. IV,
viable
Gcn5 (aka ADA4, SWI9) 1.32 kb/439aa, Chr. VII,
viable
Ada5 (aka SPT20) 1.815 kb/604aa, Chr. XV,
viable
Spt subunits size, chromosome, null p-type notes
Spt3 1.014 kb/337aa, Chr. IV,
viable
Spt7(aka GIT2) 3.999 kb/1332aa, Chr. II,
viable
Spt8 1.809 kb/602aa, Chr. XII,
viable
Spt20 (aka Ada5) 1.815 kb/604aa, Chr. XV,
viable
TAF subunits size, chromosome, null p-type notes
TAF5 (aka TAF90) 2.397 kb/798aa, Chr. II, inviable
TAF6 (aka TAF60) 1.551 kb/516aa, Chr. VII, inviable
TAF9 (aka TAF17) 0.474 kb/157aa, Chr. XIII, inviable
TAF10 (aka TAF23, TAF25) 0.621 kb/206aa, Chr. IV, inviable
TAF12(aka TAF61, TAF68) 1.620 kb/539aa, Chr. IV, inviable
Tra1 subunit size, chromosome, null p-type notes
Tra1 11.235 kb/3744aa, Chr. VIII, inviable
other subunits size, chromosome, null p-type notes
Sgf73 1.974 kb/657aa, Chr. VII ,
viable
Sgf29 0.779 kb/259aa, Chr. III,
viable
Sgf11 0.3 kb/99aa, Chr.XVI,
viable
Ubp8 1.416 kb/471aa, Chr. XIII,
viable
Sus1 gene with intron, Chr. II,
viable

Part 2: Designing deletion oligos

Everyone's starting strain is called FY2068, which has the following genotype:


MAT(alpha) ura3-52 his3D200 leu2D1 lys2-128delta

This genotype tells you that the strain is haploid and of the "alpha" mating type. The genotype also tells you that the strain cannot make its own uracil, histidine, leucine or lysine due to the indicated mutations in the URA3, HIS3, LEU2 and LYS2 genes. These mutations have insignificant effects when the strain is grown on rich or "complete" media but no growth occurs when one of those needed media components is left out, or "dropped out" as such media is usually described.

Everyone will delete a SAGA subunit of their choosing by replacing it with a selectable marker, namely the URA3 gene. Successful replacement of the SAGA subunit with the URA3 gene will restore growth of the strain on "SC-ura," which is media lacking uracil ("SC" for "synthetic complete;" "minus ura" for the absence of uracil). The primers you design today will have sequences identical to the URA3 gene, shown below. The homology will enable the primers to anneal to the URA3 gene and amplify it during the polymerase chain reactions you will perform at the end of lab today. These reactions will be more fully explained later in the experimental module.

Everyone's primers will also include "tails" that will later allow the amplified URA3 gene to replace the SAGA subunit gene of interest. These tails must be at least 39 bases long to allow for sufficient specificity and frequency of recombination once transformed for recombination into the yeast cell. This allows 20 bases for annealing to the URA3 gene since most oligonucleotide synthesis companies change their pricing structure and recommendations for oligos longer than 59 bases. Limitations in the synthesis technology impose the 59 base limit, but it turns out to be minimally intrusive for experiments like these.

Designing the "forward" primer

  1. You and your partner should begin by opening a new MSWord document to create a primer record for the sequences you are designing today. Put your names at the top, your team color, today's date and a short description of what you are trying to do.
  2. Begin your primer design by retreiving the genomic DNA sequence for the URA3 gene as it's listed at SGD. You will need 20 bases from the start of the gene to serve as the "landing sequence" for this primer. Copy these 20 bases to your MSWord document.
  3. It is extremely important to know the melting temperature for this 20 base landing sequence. The first few rounds of PCR must be performed at a temperature that allows these 20 bases to bind. The "tails" of the primer will still be present during these first rounds of PCR but will have no complementary sequences to which they can anneal. The melting temperature depends on both the length of the landing sequence, its GC content, and any secondary structures that the primer can adopt. A well-designed primer will have short hairpins (if any), its melting temperature will be around 60°C, and if possible its GC content will be about 50%. There are several websites to help you evaluate these aspects of your primer. Try to copy the 20 bases of landing sequence into the Cybergene website. Note on your MSWord document the Tm,


Designing the "reverse primer

Part 3: PCR

  1. pRS406 template

DONE!

For next time

Reagents list