From OpenWetWare

< CH391L/S12
Revision as of 21:26, 26 February 2012 by James L. Bachman (Talk | contribs)
Jump to: navigation, search



The first step toward gene expression is the transcription of a DNA template into a complementary RNA strand. This process is done by RNA polymerase, which reads the DNA template and produces an antiparallel RNA copy. As in DNA replication, the complementary strand is produced 5'->3'If the DNA template encodes for a gene, this RNA transcript will be refined into mRNA, which is further translated into a functional protein. The RNA transcript may also go on to make ribosomal RNA (rRNA), transfer RNA (tRNA), or many other RNA products. The entire process can be broken into three major steps: initiation, elongation, and termination.


Initiation of transcription occurs differently in eukaryotes and prokaryotes. In eukaryotes, the transcription initiation complex must be formed. This includes, the core promoter, transcription factors, RNA polymerase, and activators/repressors. In E. coli, RNA polymerase and sigma factors are needed, as well, it may be necessary to have activators/repressors based on the promoter being used. For E. coli, the RNA polymerase will bind tightly to the promoter to form an open promoter complex, then must choose the transcription start site and escape from the promoter. It is necessary to balance the strength of promoter binding the ability to escape so elongation can happen. RNAP may undergo abortive initiation in which it will form many short 9-10 bp segments until it clears the promoter and begins elongation.


The consensus Sequence of E. coli shown in lavender box[1]
The consensus Sequence of E. coli shown in lavender box[1]

A DNA sequence that recruits transcriptional machinery and lead to transcription of downstream DNA. In E. coli the -10bp and -35bp are locations of the most well conserved DNA sequences in bacterial promoters. There are on average 17 bp between the two sequences and 7 bp between the -10bp location and the transcription start site. Consensus sequences are the nucleotide sequences that share a common function, which is binding to RNAP in the case of promoters. Promoters that most closely resemble the consensus sequence will be the strongest promoters, just as those that differ from the consensus sequence will be weaker promoters. Interestingly, there has not been a promoter found in E. coli that is of the consensus sequence, it would likely bind so strongly that elongation would not occur.


An unregulated promoter that allows for continual transcription of its associated genes. These promoters do not rely on input and depend only the level of free RNA polymerase holoenzyme are referred to as constitutive. Since the holoenzyme is needed, it can also be said that these rely on the level of sigma factors.

Positive, Negative, and Multi-regulated promoters

These promoters depend on the level of transcription factors that are not sigma factors. In positively regulated, as the concentration of activator increase, the rate of transcription also increases. If an activator protein relies on the binding of an exogenous molecule to activate it, then the promoter may be referred to as inducible. For negative promoters, increased levels of a repressor will lower the activity of these promoters. If a repressor that inactivates the promoter is always present and an exogenous molecule is added that binds the repressor and deactivates it, then promoter may be referred to as inducible. Multi-regulated promoters are either positively or negatively regulated by multiple transcription factors. These are most useful when a promoter that relies on multiple environmental factors to function is desired.

Prokaryotic Sigma Factors

The E.coli RNA polymerase consist s of 5 subunits:2α, β, β', ω. The sigma factor is the 6th subunit, it is needed in forming the RNAP holoenzyme complex which is necessary in promoter binding. The sigma factor helps to recognize the -10 and -35 bp segments of the promoter. The most common sigma factor used in E. coli is the σ70 subunit. This is the housekeeping sigma factor and is used during transcription of most genes. It recognizes the consensus sequence: TTGACA__(17)__TATAAT. [2]There are an additional 6 sigma factors, active in different situations. Such as σ32, which is the heat shock sigma factor. The B. subtilis housekeeping sigma factor is σA, similar to σ70 in E. coli.

Determining Strength

The strength of the different promoters is determined by the relative frequency of transcription initiation. This is mainly affected by the affinity of the promoter sequence for RNA polymerase. (cite)Quantitatively measuring this can be done by monitoring protein synthesis rate of a protein such as GFP. Promoters that differ significantly from the consensus sequence will be weaker than those that resemble the consensus sequence due to binding affinity.[3]

Promoter Examples

E. coli Promoter: LacUV5

The -10 and -35bp sequences of lacUV5 and lac compared to the E. coli consensus sequence[4]
The -10 and -35bp sequences of lacUV5 and lac compared to the E. coli consensus sequence[4]

LacUV5 is a constitutive promoter mutated from of the lac promoter found in E. coli. The lac promoter is considered weak, it varies from the consensus sequence by 3 bases. On the other hand, the lacUV5 mutated promoter varies from the consensus sequence by only 1 base and is much stronger than the lac promoter.

Strong Promoter: Bacteriophage T7 Promoter

The T7 promoter is derived from bacteriophage T7. The T7 RNA polymerase has a very high affinity for its own promoters which do not occur naturally in E. coli, it also very efficient resulting in elongation that is five-fold faster than E. coli RNAP. In the experiment done by moffatt et al. the gene transcribing T7 RNAP was introduced under the control of the lacUV5 promoter. They showed that the T7 RNAP will transcribe almost any gene connected to a T7 promoter introduced into the E. coli genome. It was found that even with a small amount of T7 RNAP, the mRNA transcripts were saturating the translational machinery of E. coli. A target protein could accumulate up to 50% of the total cellular protein in ~3 hours.[5]

Constitutive Bacterial Promoter Design

Saurer et al. generate various promoters spanning two orders in magnitude of strength. The promoters were based on the E. coli rrnB P1, a strong σ70-dependent promoter with near consensus −10 (TATAAT) and −35 (TTtACg) elements. The region from position -105 to +55 were insulated, blocking almost all transcription factor binding sites as well as defining a specific 5' mRNA start site. Using proD, their first generation insulated promoter, the production of the GFP reporter gene was driven. Measuring strength of this promoter relative to another, the GFP synthesis rate was monitored. They found that the insulated proD promoter performed significantly better than a minimal promoter. Through the use of degenerate oligonucleotides, they randomized the -35 and -10 bp sequences to make a promoter library. [6] The result is a library of promoters that are highly predictable and minimize effects of from the surrounding genome.

GPD Yeast Promoter

A constitutive yeast promoter is the GPD promoter. Found in the registry: The 2011 British Columbia iGEM team performed Fluorescent analysis of the GPD promoter based on a media change. As shown, there was no noticeable difference in GFP expression. [7] This same promoter is present in the addgene S. cerevisiae Advanced Gateway Destination Vectors This kit allows for expression of open reading frames through either GPD or GAL1.


The formation of the hairpin loop followed by the stretch of Us that forms both help to terminate transcription[8]
The formation of the hairpin loop followed by the stretch of Us that forms both help to terminate transcription[8]


Terminators consist of a G+C rich dyad symmetry sequence followed by a poly (T) tract. If the terminator is bidirectional, then a poly (A) segment will be upstream to the dyad symmetry sequence. The iGEM registry has forward, reverse, and bidirectional terminators. The most important piece of a terminator is the G+C dyad sequence that will form a a hairpin loop in the RNA transcript. It has also been shown that the poly (T) tract is needed for termination to occur.



In this type of termination, a protein factor called Rho destabilizes the DNA template-RNA transcript complex, causing the release of the RNA transcript. Rho-dependent terminators are not included in the iGEM registry because these terminators are not specified by sequence.


These terminators are composed A, T rich sequences as well as a two-fold symmetric DNA sequence rich in G+C. When transcribed by RNA, these sequences lead to the formation hairpin loop rich in G-C base pairs. The formation of the RNA G-C rich stem loop causes a pause in the RNA Polymerase. This pause, followed by the transcription of the poly A tail into a run of U's causes a mechanical stress and the unwinding of the RNA-DNA complex, causing the dissociation of the RNA transcript from RNA polymerase. Rho-independent terminators may have a stabilization effect on the gene they succeed.


In yeast, termination is different for each RNA polymerase (I-III). The process involves the polyadenylation at the 3' end of the RNA transcript. A set of proteins cleave off the RNA transcript and then synthesize the poly A tail, independent of the DNA template. This step is important toward refining the RNA into mRNA that will translated.

Terminator Examples

E. coli crp Terminators

The terminator of the E. coli crp gene that encodes for the cAMP receptor protein follows a rho-independent termination model. Aiba et al. showed that crp terminator assisted in stabilizing the RNA transcript of this gene. As well, it was shown that the G+C rich stem was the stabilizing factor rather than the poly A or T segments. By making variants of the crp terminator, it was shown that disruption of the G+C rich dyad almost completely eliminated terminator function. While disruption of the T tail significantly lowered terminator function, but not completely. This showed that the G+C rich dyad is the most important piece terminator with respect to function. [9]

iGEM Terminators

Terminator sequences can be used to terminator forward, reverse, or bidirectional transcription. An example of a forward terminator is:

#$% Yeast Terminator


Error fetching PMID 2409292:
Error fetching PMID 10713082:
Error fetching PMID 16285917:
Error fetching PMID 7568019:
Error fetching PMID 3537305:
Error fetching PMID 20843779:
Error fetching PMID 17313685:
Error fetching PMID 9150882:


  2. Error fetching PMID 16285917: [Weiss2005]


  4. Error fetching PMID 3537305: [Moffat1985]
  5. Error fetching PMID 20843779: [Saurer2010]


  7. Error fetching PMID 17313685: [kingsford2007]
  8. Error fetching PMID 9150882: [aiba1996]
  9. Error fetching PMID 2409292: [Carpousis1984]
  10. Error fetching PMID 10713082: [NoelRJ2000]
  11. Error fetching PMID 7568019: [Wilson1995]
All Medline abstracts: PubMed HubMed
Personal tools