CH391L/S13/DnaAssembly

From OpenWetWare

< CH391L/S13(Difference between revisions)
Jump to: navigation, search
(Polymerase Chain Reaction)
Current revision (02:06, 3 May 2013) (view source)
(Restriction Enzyme)
 
(194 intermediate revisions not shown.)
Line 1: Line 1:
[[Category:CH391L_S13]]
[[Category:CH391L_S13]]
-
==Introduction==
+
As described [[CH391L/S13/Introduction#What_is_synthetic_biology | before]], synthetic biology captures a diverse, multi-disciplinary field. No matter which definition(s) becomes accepted, the ability to make and manipulate DNA is a vital component to practicing synthetic biology.
-
A large number of parts have been made by the synthetic biology community.  Many can be found as part of the [http://partsregistry.org/Main_Page Registry of Standard Biological Parts]. These modular genetic components are designed to be easy to assemble to facilitate the building of more complex biological devices. To learn more about the Registry and BioBricks™, see [[CH391L/S12/iGEM Registry | last year's]] wiki page.
+
A large number of parts have been made by the synthetic biology community.  Many can be found as part of the [http://partsregistry.org/Main_Page Registry of Standard Biological Parts]. These modular genetic components are designed to be easy to acquire and assemble to facilitate the building of more complex biological devices. To learn more about the Registry and the biological parts known as BioBricks™, see the entry for the [[CH391L/S12/iGEM Registry | iGEM Registry]].
-
 
+
-
Synthetic biology relies on the ability to make testable biological units.  This means that the ability to gather, manipulate, or even create genetic material is vital for "doing" synthetic biology. The techniques employed by synthetic biologists include oligonucleotide synthesis, restriction enzyme digestion, ligase, pcr, and recombination (homology).
+
-
 
+
-
Trying to get from genes to genomes.
+
==DNA Synthesis==
==DNA Synthesis==
 +
===Oligonucleotide Synthesis===
===Oligonucleotide Synthesis===
 +
<!--Regardless of the length of the eventual product, synthetic DNA constructs are built from some combination of short DNA oligonucleotides. These oligonucleotides are later assembled into a complementary DNA duplex, amplified and inserted into their final genetic context.
-
[http://en.wikipedia.org/wiki/Oligonucleotide_synthesis from wikipedia] <br>
+
http://med.stanford.edu/sgtc/resources/images/Chemical_DNA_Synthesis_Chart_3-9-10_revised_PW.pdf -->
 +
 
 +
Oligonucleotides, short single-stranded DNA pieces, are [http://en.wikipedia.org/wiki/Oligonucleotide_synthesis chemically synthesized] from DNA phosphoramidite monomers. Briefly, activated phosphoramidite monomers are added in the 3' to 5' direction using a cycle of activation and blocking chemistry to obtain a DNA polymer linked by phosphodiester bonds.
 +
 
 +
[[Image:CH391L_S12_Phosphoramidite.png]]
 +
 
 +
Chemical synthesis is currently limited to oligonucleotides of about 200 nt in length. It is used often by synthetic biologists to make primers for PCR or gibson assembly (described later).
===Gene Synthesis===
===Gene Synthesis===
-
IDT Gene Blocks <br>
+
Gene synthesis, or [http://en.wikipedia.org/wiki/Artificial_gene_synthesis artificial gene synthesis], refers to the process of creating a nucleic acid template for a gene ''in vitro'', without the requirement of a preexisting DNA template. It predates the invention of restriction enzymes and molecular cloning techniques by several years. The first gene to be completely synthesized ''in vitro'' was a 77-nt alanine transfer RNA by the laboratory of [http://en.wikipedia.org/wiki/Har_Gobind_Khorana Har Gobind Khorana] in 1972 <cite>Khorana1972</cite>. This was the result of nearly five years of work and resulted in a DNA template without promoter or transcriptional control sequences. The first peptide- and protein-producing synthetic genes were created in 1977 and 1979, respectively <cite>Itakura1977, Goeddell1979</cite>. Chemical synthesis of DNA has grown from an expensive and time-consuming process into a viable commercial industry capable of versatile and high-throughput manufacturing. DNA can now be made to order for species-specific gene optimization, creation of genes from rare or dangerous sources, and combinatorial assembly of any DNA sequence that can be chemically synthesized, including non-natural bases.
-
DNA2.0 <br>
+
 
-
invitrogen (life technologies, geneart) <br>
+
[[Image:Carlson_longest_sDNA_2010-thumb-500x457.png|thumb|center|500px|Longest Published Synthetic DNA [http://www.synthesis.cc/2009/06/data-and-references-for-longest-published-sdna.html]]]
-
genscript <br>
+
 
-
celtek <br>
+
Despite nearly four decades of progress in gene synthesis technologies, most DNA sequences used in modern molecular biology are assembled in part or in whole from naturally occurring templates. However this limits the scope and applications to previously existing genes and the results of large-scale genomic surveys of novel genes from nature. Modern gene synthesis relies heavily on advancements in chemical DNA oligonucleotide synthesis, with the primary challenges being scale, cost, fidelity and the eventual assembly of complete gene products.
-
biobasic <br>
+
 
-
sigma? <br>
+
An extensive, but not comprehensive, directory of commercial gene synthesis providers can be found at [http://www.genespace.net/ Genespace].
-
GENEWIZ <br>
+
 
-
biomatik <br>
+
=====Economics=====
-
OriGene (blue heron) <br>
+
 
-
Epoch Life Science <br>
+
<!--Currently, synthetic DNA oligonucleotides less than 60 Bases can be obtained for about #0.35 USD/Base [http://www.idtdna.com/pages/products/dna-rna/custom-dna-oligos]-->
 +
 
 +
Gene synthesis is one of the key enabling technologies of synthetic biology. The increase in efficiency and the decrease in cost is occurring at a staggering rate. Below is a graph comparing DNA synthesis to Moore's Law. In addition, it includes another enabling technology, DNA sequencing. The bottom right graph shows that genome synthesis is technically feasible <cite>Gibson2009</cite>; however, it is not yet a commercial service.
 +
 
 +
[[Image:Carlson_cost_per_base_june_2011.png|thumb|left|375px|Cost per Base of DNA Sequencing and Synthesis.[http://www.synthesis.cc]]]
 +
[[Image:Cost_per_genome_20130122.jpg|thumb|center|425px| Cost per Genome.[http://www.genome.gov/sequencingcosts/]]]
-
References <br>
+
<!-- [http://www.synthesis.cc http://www.synthesis.cc/assets_c/2010/05/carlson_longest_sDNA_2010-thumb-500x457.png]-->
-
Cost <br>
+
<!--[http://www.synthesis.cc http://www.synthesis.cc/carlson_cost%20per_base_june_2011.png]-->
-
Time <br>
+
<!--[http://www.genome.gov/sequencingcosts/ http://www.genome.gov/images/content/cost_per_genome.jpg]-->
==Molecular Cloning==
==Molecular Cloning==
 +
 +
It is impractical for most synthetic biologists to synthesize more than several kilobases of completely synthetic DNA. It is often desirable to build bigger pieces at lower costs and faster speeds than de novo synthesis is currently able to accomplish.
 +
 +
Over the years, many different strategies have been developed to assemble DNA in flexible ways that suit different purposes.  These strategies typically employ purification of enzymes that are known to modify DNA in specific ways and these methods of action can be exploited for designing and building specific sequences of DNA. For example, restriction enzymes are used to cut DNA in a specific manner upon recognition of a specific nucleotide sequence. Polymerases and endonucleases add or remove nucleotides to make double stranded DNA from a single stranded template or to create single stranded DNA from double stranded DNA.  Many of the modern techniques take advantage of recombination machinery that move DNA from one location to another location. In other cases, the endogenous enzymes in a host are utilized to manipulate DNA without the need for prior purification.  For example, in some methods endogenous DNA ligase is used to repair single stranded breaks (known as "nicks") to complete the formation of fully circular DNA.
 +
 +
Ultimately, these methods generally require transformation into a host where endogenous enzymes are used to complete the genetic manipulation and replicate (clone) the genetic material. This allows for the expression of the desired proteins to test the ability of the engineered system or for the purification of the genetic material itself such that it can be used for further manipulation, study, or storage.
 +
 +
While there are many specific protocols for the numerous methods of cloning, most share reasonable overlap in their underlying mechanisms of action. Broadly speaking, methods may rely primarily on restriction enzymes, polymerase chain reaction (PCR), or on homologous recombination <cite>Hughes2011, Czar2009</cite>.
===Restriction Enzyme===
===Restriction Enzyme===
-
BioBricks <br>
+
 
-
BglBricks <br>
+
Restriction enzymes recognize a specific nucleotide sequence and then cut the DNA in such a way that results in a double stranded break. If the enzyme cuts within or near the recognition site, it is classified as a Type II restriction enzyme. When a restriction enzyme cuts a piece of DNA, the ends can be straight resulting in a blunt end or with a jagged cut resulting in a sticky end.
-
[[The_BioBricks_Foundation:Standards/Technical/Formats | list]] of BioBrick Foundation Standards <br>
+
 
 +
For example:
 +
EcoRI digestion produces "sticky" ends, <br>
 +
 
 +
[[Image:EcoRI.png|thumb|center|300px| "sticky" end cutter: EcoRI]] <br>
 +
 
 +
whereas AluI restriction enzyme cleavage produces "blunt" ends: <br>
 +
 
 +
[[Image:AluI.png|thumb|center|300px|"blunt" end cutter: AluI]] <br>
 +
 
 +
The Registry of Biological Parts has developed [[The_BioBricks_Foundation:Standards/Technical/Formats | standards]] for the type and position of restriction enzyme sites to be used when building DNA using restriction enzymes in order to ensure compatibility in the connecting of biological parts. Two of the more popular standards are named and listed below:
 +
 
 +
*BioBricks <br>
 +
*BglBricks <br>
 +
 
 +
One modification to the standard restriction enzyme method is the use of an enzyme that recognizes a non-palindromic sequence. This results in the ability to cut with a single restriction enzyme and still maintain directionality of the biological part.
 +
 
 +
*CpoI directional cloning <br>
 +
 
 +
Other methods have been developed that utilize Type IIs restriction enzymes. These enzymes cut away from their recognition site. This strategy has the advantage over Type II enzymes in their reduction of a "scar" sequence and ability to generate combinatorial libraries.
 +
 
 +
*golden gate  <cite>Engler2008</cite> <br>
 +
*Modular Cloning (MoClo)  <cite>Weber2011</cite> <cite>Werner2012</cite> <br>
 +
*GoldenBraid  <cite>SarrionPerdigones2011</cite><br>
===Polymerase Chain Reaction===
===Polymerase Chain Reaction===
-
TOPO TA cloning (invitrogen) <br>
 
-
SOE (splice by overlap extension) pcr <br>
 
-
===Ligase===
+
Polymerase Chain Reaction (PCR) is a fast and reliable way of making multiple copies of specific sequences of DNA.  The PCR method typically allows for relatively simple and fast construction of genes by linking multiple shorter oligonucleotides together in a series of steps where the earlier PCR product acts as a primer that results in a longer DNA fragment that itself can act as a primer for subsequent rounds of PCR.  An exception is TOPO TA Cloning (Invitrogen) <cite>Holton1991</cite><cite>Shuman1991</cite><br>, which utilizes the propert of Taq polymerase to leave a single A nucleotide overhang after the PCR reaction.  In combination with an optimized vector with a T nucleotide overhang and a topoisomerase that recognizes the vector at the T overhang position, the insert and vector that are annealed at the T and A nucleotides are sealed into a circular vector efficiently.
 +
 
 +
Oligonucleotide precursors are linked together leaving gaps that must be filled in by DNA polymerase. In the most common method, polymerase cycling assembly (PCA) <cite>Stemmer1995</cite>, DNA oligonucleotides are designed to be part of either the top or bottom of the final DNA duplex. Cycles of annealing and polymerase extension result in a growing DNA duplex built from smaller oligonucleotide fragments. A final round of PCR is done with constant primers that amplify the complete desired gene product. Initial oligo linkage is done for 20-30 cycles and full-length PCR is carried out for 20-30 additional cycles.
 +
 
 +
Oligos must be designed carefully to have similar melting temperatures and be free of interfering secondary structures (high GC content, hairpins, etc.)
 +
 
 +
[[Image:CH391L_S12_PCA.jpg|thumb|center|500px|A schematic diagram of PCA]]
 +
 
 +
Additional methods of gene assembly by PCR are listed below:
 +
*Ligation Independent Cloning (LIC) <cite>Aslanidis1990</cite><cite>Aslanidis1994</cite><cite>Li1997</cite> <br>
 +
*Uracil-Specific Excision Reagent (USER) (NEB) <cite>GeuFlores2007</cite><br>
 +
*Splice by Overlap Extension (SOE) PCR <cite>Higuchi1988</cite><cite>Horton2009</cite><br>
 +
 
===Recombination/Homology===
===Recombination/Homology===
-
[http://j5.jbei.org j5] from [[User:Nathan_J._Hillson | Nathan Hillson]] <br>
 
-
-In-Fusion (Clontech) poxvirus DNA polymerase with 3′–5′ exonuclease activity <br>
+
The following methods rely on recombination of homologous DNA for gene assembly:
-
-In-Fusion BioBrick Assembly <br>
+
-
-cold fusion (SBI) <br>
+
-
-lambda red <br>
+
*SLIC sequence and ligation independent cloning T4 DNA polymerase (exonuclease) <cite>Li2007,Li2012</cite><br>
-
-golden gate <br>
+
-
-lic <br>
+
-
-gateway <br>
+
-
-SLIC sequence and ligation independent cloning T4 DNA polymerase (exonuclease) <br>
+
[[Image:Infusion.gif|thumb| right|300px| [http://openwetware.org/wiki/In-fusion_biobrick_assembly In-Fusion Cloning Diagram]]] <br>
-
-[http://openwetware.org/wiki/Gibson_Assembly Gibson] T5 exonuclease, Phusion polymerase, Taq ligase <br>
+
-
-CPEC <br>
+
-
-SLiCE (Seamless Ligation Cloning Extract) in vitro homologous recombination <br>
+
*Variations of the SLIC procedure:
 +
**In-Fusion (Clontech) poxvirus DNA polymerase with 3′–5′ exonuclease activity <cite>Zhu2007</cite><cite>Benoit2006</cite><br>
 +
**In-Fusion BioBrick Assembly <cite>Sleight2010</cite> <br>
 +
**Cold fusion (SBI) <br>
 +
**CloneEZ kit (Genescript) [http://www.genscript.com/tech_guide/TM0279.pdf], recombination around a linearized vector<br>
 +
**GENEART Seamless Cloning (Life Technologies previously Invitrogen previously DoGene) <br>
-
-Fast Seamless Cloning (Dogene)
+
*Cre/Lox P1 phage (BD Creator from Clontech) <br>
-
-CloneEZ kit (Genescript)
+
*att lambda (Gateway Cloning from Life Technologies) <cite>Hartley2000</cite><cite>Khalil2007</cite><br>
-
-GENEART
+
-
===Transformation===
+
*CPEC circular polymerase extension cloning <cite>Quan2009</cite><br>
-
*e coli <br>
+
*SLiCE (Seamless Ligation Cloning Extract) in vitro homologous recombination <cite>Zhang2012</cite> <br>
-
*s. cerevisiae <br>
+
-
===Bacterial Mating===
+
====Gibson Assembly====
 +
In 2009, Daniel Gibson of J. Craig Venter's group reported a method for assembling hundreds of kilobases of DNA sequence using an isothermal enzymatic setup known today as [http://openwetware.org/wiki/Gibson_Assembly "Gibson Assembly"] <cite>Gibson2008</cite><cite>Gibson2009</cite><cite>Gibson2009b</cite><cite>Gibson2011</cite>. Sub-fragments of the eventual assembly product are obtained as blunt-ended double-stranded DNA molecules several kilobases in length. They share several hundred bases of homology at their adjacent termini. T5 DNA Exonuclease chews back the double-stranded DNA molecule to reveal 3' single-stranded overhangs. Being heat-liable, T5 exonuclease is inactivated after several minutes at the reaction temperature. A mixture of thermostable ligase (Taq) and DNA polymerase (commonly Phusion) repair and assemble the individual fragments into larger molecules. This method has been used to assemble nonfunctional DNA molecules up to 900 kilobases in length, as well as the complete 16.3 kilobase mouse mitochondrial genome <cite>Gibson2010b</cite>. The size limit of molecules that can be assembled in this manner is currently limited to the cost of design and screening the final assembly products and the hundreds of kilobases of DNA that must be sequenced. An illustrative [https://www.neb.com/applications/cloning-and-mapping/gibson-assembly video] of this process can be found on the NEB website. Of all the cloning techniques listed on this page, "Gibson" cloning is becoming one of the most dominant cloning strategies in synthetic biology labs. Its fast, flexible, relatively simple design has made it a favorite among synthetic biologists.
-
-MoClo <cite>Weber2011</cite>
+
[[Image:Gibson.png|thumb|center|500px| Schematic Drawing of the "Gibson" cloning method. Note that this can be done on multiple DNA fragments and in "one pot" at a constant temperature (50° C). ]]
-
-mage <br>
+
 
 +
===In Vivo===
 +
There is a category of gene assembly tools that operate in live cells:
 +
 
 +
*Mating-assisted genetically integrated circuit (MAGIC)-bacterial mating <cite>Li2005</cite><br>
 +
*ET recombination <cite>Zhang1998</cite> <br>
 +
*Transformation-associated recombination <cite>Larionov1996</cite> <br>
 +
 
 +
==Moving Forward: Scaling Up Gene Synthesis==
 +
 
 +
The methods below are not new methods and some are already being used commercially. What they offer for the future are methods for potentially scaling up production while decreasing costs.  As mentioned before, cost and speed are major limitations preventing de novo DNA synthesis from becoming standard practice.
 +
 
 +
====Solid-Phase Synthesis====
 +
An alternative gene synthesis technique using solid-phase synthesis of a DNA duplex has been created by several companies, including [http://www.blueheronbio.com/ Blue Heron]. In these methods, DNA oligonucleotides are synthesized as in other techniques, they are annealed into complementary short duplexes without overhanging ends, and these duplexes are chemically ligated on a solid phase such as a liquid chromatography column. The complete synthetic DNA duplex is then eluted and used for downstream processes, such as cloning or gene expression.
 +
 
 +
====Multiplex/Microchip Synthesis====
 +
DNA oligonucleotides have become a commodity and the price per base has been steadily dropping. But there is still great expense involved in large-scale gene synthesis. For a several kilobase gene fragment, oligo cost alone can still exceed $1,000. To over come this hurdle, multiplex parallel gene synthesis technologies are under development, including inkjet DNA printers, photolithography and electrochemical parallel synthesis.
 +
 
 +
Microfluidic microchips are a common multiplexing solution. In the method of Tian et al. <cite>Tian2004</cite>, oligonucleotides are synthesized while coupled to a microchip at their 3' end. The overall process is similar to traditional oligo synthesis, except that individual wells are photoactivated before addition of the next base onto the 5' end. After chemical cleavage of the complete oligo from the microchip, they are hybridized and purified using complementary "quality assurance" oligos bound to magnetic beads. This method eliminates oligos that have either incorporated extra bases or undergone deletions. Final yield from each synthesis well is only around 5 fmol of oligo, which is not sufficient for downstream applications. A final amplification step using common PCR primers to all oligos increases yields up to a million-fold. Oligos can then be assembled by overlap PCR-based methods such as polymerase assembly multiplexing, or PAM. This method has resulted in the synthesis, amplification and expression of a 14.5 kb, 21-gene 30S ribosomal protein operon from ''E. coli''.
 +
 
 +
====Minimizing Errors====
 +
One major roadblock remaining in synthetic gene synthesis is overcoming natural error rates in DNA synthesis chemistry. Phosphoramidite oligo synthesis currently has an error rate of 1 in ~160. This results in a high enough error rate in assembled genes that screening and sequencing are major cost bottlenecks, as is the repair or re-synthesis of mutated genes. While the development of cheap and high-throughput sequencing technologies will streamline the screening process, oligo synthesis error rates must also be reduced.
 +
 
 +
Currently, oligo pools are often screened against complementary reference libraries that can be reused. Oligos that do not completely base-pair with their designed sequence are either washed away in purification steps or are cleaved by DNA mismatch repair proteins due to their imperfect base-pairing. This has currently decreased error rates to 1 in 10<sup>6</sup>, on the order of error rates in DNA polymerases.
 +
 
 +
===Genome Synthesis===
 +
J. Craig Venter's team at JCVI carried out the largest gene synthesis currently reported through the creation of a completely synthetic genome. Initially, the JCVI team synthesized a 582,970-base pair "minimized" genome of ''Mycoplasma genitalium'' using a stepwise assembly <cite>Gibson2008</cite>. The genome was divided into 24 cassettes of about 24 kb each which were synthesized by commercial manufacturers. These overlapping cassettes were enzymatically stitched together into bacterial artificial chromosomes, which were attached in a stepwise fashion using a combination of restriction digestion and ''in vitro'' recombination. At the point where the molecules were "quarter genomes", they were transformed into yeast where ''in vivo'' recombination machinery assembled the full ''M. genitalium'' chromosome. This was only proof of synthesis, however, and not of a synthetic genome useful for supporting a living organism.
 +
 
 +
In 2010, the JCVI team reported the creation of a bacterium controlled by a fully synthetic genome <cite>Gibson2010</cite>. The 1.08 mega-base pair genome of ''Mycoplasma mycoides'' was synthesized and transplanted into a ''M. capricolum'' recipient cell to create a synthetic bacterium nicknamed "Synthia". The genome was divided into 1,078 1-kb cassettes, which were synthesized by Blue Heron. These cassettes were assembled in yeast using ''in vivo'' homologous recombination using a three-step hierarchy that built fragments of ~10,000 bp, ~100,000 bp and finally the complete 1.08 mega-bp genome. These were then isolated carefully from contaminating yeast DNA and transplanted into the recipient bacteria. A selectable marker placed in the synthetic genome demonstrated that bacteria that grew after transplantation were controlled by the synthetic genome. The same team demonstrated that a semi-synthetic genome could be assembled and transplanted using a combination of natural and synthetic DNA.
 +
 
 +
Final assembly of large double-stranded DNA products into kilobase and megabase functional molecules, free of errors and with minimal cost remains a process bottleneck for gene and genome synthesis. ''In vitro'' recombination methods have been successfully used to assemble large DNA molecules on the order of tens of kilobases, but this method remains prone to errors and requires considerable homology be engineered into assembly fragments.
 +
 
 +
==See Also==
 +
 
 +
* [http://django.gibthon.org/ Gibthon - Gibson assembly design program]
 +
*[http://j5.jbei.org j5]: A tool for designing DNA assembly with recombination from [[User:Nathan_J._Hillson | Nathan Hillson]] The manual has an excellent overview of recombination-based cloning strategies like SLIC and Gibson. <cite>Hillson2012</cite><br>
 +
*[http://dnasu.asu.edu/DNASU/CloningStrategies.jsp Review] of Cloning techniques from ASU <br>
== References ==  
== References ==  
<biblio>
<biblio>
 +
#Khorana1972 pmid=4571075
 +
#Itakura1977 pmid=412251
 +
#Goeddell1979 pmid=85300
 +
#Tian2004 pmid=15616567
 +
#Gibson2008 pmid=18218864
 +
//Complete genome synthesis
 +
#Gibson2009 pmid=19363495
 +
//oligonucleotide assembly in vitro
 +
#Gibson2009b pmid=19745056
 +
//oligonucleotide assembly in yeast
 +
#Gibson2010 pmid=20488990
 +
//genome replacement
 +
#Gibson2010b pmid=20935651
 +
#Gibson2011 pmid=21601685
 +
//MIE paper
 +
#Dymond2011 pmid=21918511
#Hughes2011 pmid=21601682
#Hughes2011 pmid=21601682
//Gene Synthesis Review
//Gene Synthesis Review
Line 91: Line 189:
#Quan2009 pmid=19649325
#Quan2009 pmid=19649325
//CPEC
//CPEC
-
#Gibson2008 pmid=18218864
 
-
//
 
-
#Gibson2009 pmid=19363495
 
//T5 exonuclease recombination
//T5 exonuclease recombination
#Li2007 pmid=17293868
#Li2007 pmid=17293868
Line 107: Line 202:
#GeuFlores2007 pmid=17389646
#GeuFlores2007 pmid=17389646
//USER  
//USER  
-
#Gibson2009-2 pmid=19745056
 
-
//oligonucleotide assembly
 
#Horton2009 pmid=2357375
#Horton2009 pmid=2357375
//SOEing
//SOEing
Line 121: Line 214:
#Li1997 pmid=9321675
#Li1997 pmid=9321675
//LIC
//LIC
-
#Angrand1999 pmid=10446259
+
#Zhang1998 pmid=9771703
-
//lambda Red recombinase
+
//ET recombination
#Hartley2000 pmid=11076863
#Hartley2000 pmid=11076863
//Gateway lambda Int
//Gateway lambda Int
Line 130: Line 223:
//Transformation-associated recombination (TAR) cloning
//Transformation-associated recombination (TAR) cloning
#Hillson2012 j5 DNA Assembly Design Automation Software [[doi: 10.1021/sb2000116]]
#Hillson2012 j5 DNA Assembly Design Automation Software [[doi: 10.1021/sb2000116]]
-
#Gibson2010 pmid=20488990
 
-
//genome replacement
 
#Li2005 pmid=15731760
#Li2005 pmid=15731760
//MAGIC, bacterial mating approach
//MAGIC, bacterial mating approach
-
</biblio>
 
#Zhang2012 pmid=22241772
#Zhang2012 pmid=22241772
//SLiCe
//SLiCe
 +
#Higuchi1988 pmid=3045756
 +
//SOE-PCR
 +
#Holton1991 pmid=2020554
 +
//TA cloning
 +
#Shuman1991 pmid=1658796
 +
//TOPO
 +
</biblio>

Current revision


As described before, synthetic biology captures a diverse, multi-disciplinary field. No matter which definition(s) becomes accepted, the ability to make and manipulate DNA is a vital component to practicing synthetic biology.

A large number of parts have been made by the synthetic biology community. Many can be found as part of the Registry of Standard Biological Parts. These modular genetic components are designed to be easy to acquire and assemble to facilitate the building of more complex biological devices. To learn more about the Registry and the biological parts known as BioBricks™, see the entry for the iGEM Registry.

Contents

DNA Synthesis

Oligonucleotide Synthesis

Oligonucleotides, short single-stranded DNA pieces, are chemically synthesized from DNA phosphoramidite monomers. Briefly, activated phosphoramidite monomers are added in the 3' to 5' direction using a cycle of activation and blocking chemistry to obtain a DNA polymer linked by phosphodiester bonds.

Image:CH391L_S12_Phosphoramidite.png

Chemical synthesis is currently limited to oligonucleotides of about 200 nt in length. It is used often by synthetic biologists to make primers for PCR or gibson assembly (described later).

Gene Synthesis

Gene synthesis, or artificial gene synthesis, refers to the process of creating a nucleic acid template for a gene in vitro, without the requirement of a preexisting DNA template. It predates the invention of restriction enzymes and molecular cloning techniques by several years. The first gene to be completely synthesized in vitro was a 77-nt alanine transfer RNA by the laboratory of Har Gobind Khorana in 1972 [1]. This was the result of nearly five years of work and resulted in a DNA template without promoter or transcriptional control sequences. The first peptide- and protein-producing synthetic genes were created in 1977 and 1979, respectively [2, 3]. Chemical synthesis of DNA has grown from an expensive and time-consuming process into a viable commercial industry capable of versatile and high-throughput manufacturing. DNA can now be made to order for species-specific gene optimization, creation of genes from rare or dangerous sources, and combinatorial assembly of any DNA sequence that can be chemically synthesized, including non-natural bases.

Longest Published Synthetic DNA [1]
Longest Published Synthetic DNA [1]

Despite nearly four decades of progress in gene synthesis technologies, most DNA sequences used in modern molecular biology are assembled in part or in whole from naturally occurring templates. However this limits the scope and applications to previously existing genes and the results of large-scale genomic surveys of novel genes from nature. Modern gene synthesis relies heavily on advancements in chemical DNA oligonucleotide synthesis, with the primary challenges being scale, cost, fidelity and the eventual assembly of complete gene products.

An extensive, but not comprehensive, directory of commercial gene synthesis providers can be found at Genespace.

Economics

Gene synthesis is one of the key enabling technologies of synthetic biology. The increase in efficiency and the decrease in cost is occurring at a staggering rate. Below is a graph comparing DNA synthesis to Moore's Law. In addition, it includes another enabling technology, DNA sequencing. The bottom right graph shows that genome synthesis is technically feasible [4]; however, it is not yet a commercial service.

Cost per Base of DNA Sequencing and Synthesis.[2]
Cost per Base of DNA Sequencing and Synthesis.[2]
Cost per Genome.[3]
Cost per Genome.[3]


Molecular Cloning

It is impractical for most synthetic biologists to synthesize more than several kilobases of completely synthetic DNA. It is often desirable to build bigger pieces at lower costs and faster speeds than de novo synthesis is currently able to accomplish.

Over the years, many different strategies have been developed to assemble DNA in flexible ways that suit different purposes. These strategies typically employ purification of enzymes that are known to modify DNA in specific ways and these methods of action can be exploited for designing and building specific sequences of DNA. For example, restriction enzymes are used to cut DNA in a specific manner upon recognition of a specific nucleotide sequence. Polymerases and endonucleases add or remove nucleotides to make double stranded DNA from a single stranded template or to create single stranded DNA from double stranded DNA. Many of the modern techniques take advantage of recombination machinery that move DNA from one location to another location. In other cases, the endogenous enzymes in a host are utilized to manipulate DNA without the need for prior purification. For example, in some methods endogenous DNA ligase is used to repair single stranded breaks (known as "nicks") to complete the formation of fully circular DNA.

Ultimately, these methods generally require transformation into a host where endogenous enzymes are used to complete the genetic manipulation and replicate (clone) the genetic material. This allows for the expression of the desired proteins to test the ability of the engineered system or for the purification of the genetic material itself such that it can be used for further manipulation, study, or storage.

While there are many specific protocols for the numerous methods of cloning, most share reasonable overlap in their underlying mechanisms of action. Broadly speaking, methods may rely primarily on restriction enzymes, polymerase chain reaction (PCR), or on homologous recombination [5, 6].

Restriction Enzyme

Restriction enzymes recognize a specific nucleotide sequence and then cut the DNA in such a way that results in a double stranded break. If the enzyme cuts within or near the recognition site, it is classified as a Type II restriction enzyme. When a restriction enzyme cuts a piece of DNA, the ends can be straight resulting in a blunt end or with a jagged cut resulting in a sticky end.

For example: EcoRI digestion produces "sticky" ends,

"sticky" end cutter: EcoRI
"sticky" end cutter: EcoRI

whereas AluI restriction enzyme cleavage produces "blunt" ends:

"blunt" end cutter: AluI
"blunt" end cutter: AluI

The Registry of Biological Parts has developed standards for the type and position of restriction enzyme sites to be used when building DNA using restriction enzymes in order to ensure compatibility in the connecting of biological parts. Two of the more popular standards are named and listed below:

  • BioBricks
  • BglBricks

One modification to the standard restriction enzyme method is the use of an enzyme that recognizes a non-palindromic sequence. This results in the ability to cut with a single restriction enzyme and still maintain directionality of the biological part.

  • CpoI directional cloning

Other methods have been developed that utilize Type IIs restriction enzymes. These enzymes cut away from their recognition site. This strategy has the advantage over Type II enzymes in their reduction of a "scar" sequence and ability to generate combinatorial libraries.

  • golden gate [7]
  • Modular Cloning (MoClo) [8] [9]
  • GoldenBraid [10]

Polymerase Chain Reaction

Polymerase Chain Reaction (PCR) is a fast and reliable way of making multiple copies of specific sequences of DNA. The PCR method typically allows for relatively simple and fast construction of genes by linking multiple shorter oligonucleotides together in a series of steps where the earlier PCR product acts as a primer that results in a longer DNA fragment that itself can act as a primer for subsequent rounds of PCR. An exception is TOPO TA Cloning (Invitrogen) [11][12]
, which utilizes the propert of Taq polymerase to leave a single A nucleotide overhang after the PCR reaction. In combination with an optimized vector with a T nucleotide overhang and a topoisomerase that recognizes the vector at the T overhang position, the insert and vector that are annealed at the T and A nucleotides are sealed into a circular vector efficiently.

Oligonucleotide precursors are linked together leaving gaps that must be filled in by DNA polymerase. In the most common method, polymerase cycling assembly (PCA) [13], DNA oligonucleotides are designed to be part of either the top or bottom of the final DNA duplex. Cycles of annealing and polymerase extension result in a growing DNA duplex built from smaller oligonucleotide fragments. A final round of PCR is done with constant primers that amplify the complete desired gene product. Initial oligo linkage is done for 20-30 cycles and full-length PCR is carried out for 20-30 additional cycles.

Oligos must be designed carefully to have similar melting temperatures and be free of interfering secondary structures (high GC content, hairpins, etc.)

A schematic diagram of PCA
A schematic diagram of PCA

Additional methods of gene assembly by PCR are listed below:

  • Ligation Independent Cloning (LIC) [14][15][16]
  • Uracil-Specific Excision Reagent (USER) (NEB) [17]
  • Splice by Overlap Extension (SOE) PCR [18][19]

Recombination/Homology

The following methods rely on recombination of homologous DNA for gene assembly:

  • SLIC sequence and ligation independent cloning T4 DNA polymerase (exonuclease) [20, 21]

  • Variations of the SLIC procedure:
    • In-Fusion (Clontech) poxvirus DNA polymerase with 3′–5′ exonuclease activity [22][23]
    • In-Fusion BioBrick Assembly [24]
    • Cold fusion (SBI)
    • CloneEZ kit (Genescript) [4], recombination around a linearized vector
    • GENEART Seamless Cloning (Life Technologies previously Invitrogen previously DoGene)
  • Cre/Lox P1 phage (BD Creator from Clontech)
  • att lambda (Gateway Cloning from Life Technologies) [25][26]
  • CPEC circular polymerase extension cloning [27]
  • SLiCE (Seamless Ligation Cloning Extract) in vitro homologous recombination [28]

Gibson Assembly

In 2009, Daniel Gibson of J. Craig Venter's group reported a method for assembling hundreds of kilobases of DNA sequence using an isothermal enzymatic setup known today as "Gibson Assembly" [29][4][30][31]. Sub-fragments of the eventual assembly product are obtained as blunt-ended double-stranded DNA molecules several kilobases in length. They share several hundred bases of homology at their adjacent termini. T5 DNA Exonuclease chews back the double-stranded DNA molecule to reveal 3' single-stranded overhangs. Being heat-liable, T5 exonuclease is inactivated after several minutes at the reaction temperature. A mixture of thermostable ligase (Taq) and DNA polymerase (commonly Phusion) repair and assemble the individual fragments into larger molecules. This method has been used to assemble nonfunctional DNA molecules up to 900 kilobases in length, as well as the complete 16.3 kilobase mouse mitochondrial genome [32]. The size limit of molecules that can be assembled in this manner is currently limited to the cost of design and screening the final assembly products and the hundreds of kilobases of DNA that must be sequenced. An illustrative video of this process can be found on the NEB website. Of all the cloning techniques listed on this page, "Gibson" cloning is becoming one of the most dominant cloning strategies in synthetic biology labs. Its fast, flexible, relatively simple design has made it a favorite among synthetic biologists.

Schematic Drawing of the "Gibson" cloning method. Note that this can be done on multiple DNA fragments and in "one pot" at a constant temperature (50° C).
Schematic Drawing of the "Gibson" cloning method. Note that this can be done on multiple DNA fragments and in "one pot" at a constant temperature (50° C).

In Vivo

There is a category of gene assembly tools that operate in live cells:

  • Mating-assisted genetically integrated circuit (MAGIC)-bacterial mating [33]
  • ET recombination [34]
  • Transformation-associated recombination [35]

Moving Forward: Scaling Up Gene Synthesis

The methods below are not new methods and some are already being used commercially. What they offer for the future are methods for potentially scaling up production while decreasing costs. As mentioned before, cost and speed are major limitations preventing de novo DNA synthesis from becoming standard practice.

Solid-Phase Synthesis

An alternative gene synthesis technique using solid-phase synthesis of a DNA duplex has been created by several companies, including Blue Heron. In these methods, DNA oligonucleotides are synthesized as in other techniques, they are annealed into complementary short duplexes without overhanging ends, and these duplexes are chemically ligated on a solid phase such as a liquid chromatography column. The complete synthetic DNA duplex is then eluted and used for downstream processes, such as cloning or gene expression.

Multiplex/Microchip Synthesis

DNA oligonucleotides have become a commodity and the price per base has been steadily dropping. But there is still great expense involved in large-scale gene synthesis. For a several kilobase gene fragment, oligo cost alone can still exceed $1,000. To over come this hurdle, multiplex parallel gene synthesis technologies are under development, including inkjet DNA printers, photolithography and electrochemical parallel synthesis.

Microfluidic microchips are a common multiplexing solution. In the method of Tian et al. [36], oligonucleotides are synthesized while coupled to a microchip at their 3' end. The overall process is similar to traditional oligo synthesis, except that individual wells are photoactivated before addition of the next base onto the 5' end. After chemical cleavage of the complete oligo from the microchip, they are hybridized and purified using complementary "quality assurance" oligos bound to magnetic beads. This method eliminates oligos that have either incorporated extra bases or undergone deletions. Final yield from each synthesis well is only around 5 fmol of oligo, which is not sufficient for downstream applications. A final amplification step using common PCR primers to all oligos increases yields up to a million-fold. Oligos can then be assembled by overlap PCR-based methods such as polymerase assembly multiplexing, or PAM. This method has resulted in the synthesis, amplification and expression of a 14.5 kb, 21-gene 30S ribosomal protein operon from E. coli.

Minimizing Errors

One major roadblock remaining in synthetic gene synthesis is overcoming natural error rates in DNA synthesis chemistry. Phosphoramidite oligo synthesis currently has an error rate of 1 in ~160. This results in a high enough error rate in assembled genes that screening and sequencing are major cost bottlenecks, as is the repair or re-synthesis of mutated genes. While the development of cheap and high-throughput sequencing technologies will streamline the screening process, oligo synthesis error rates must also be reduced.

Currently, oligo pools are often screened against complementary reference libraries that can be reused. Oligos that do not completely base-pair with their designed sequence are either washed away in purification steps or are cleaved by DNA mismatch repair proteins due to their imperfect base-pairing. This has currently decreased error rates to 1 in 106, on the order of error rates in DNA polymerases.

Genome Synthesis

J. Craig Venter's team at JCVI carried out the largest gene synthesis currently reported through the creation of a completely synthetic genome. Initially, the JCVI team synthesized a 582,970-base pair "minimized" genome of Mycoplasma genitalium using a stepwise assembly [29]. The genome was divided into 24 cassettes of about 24 kb each which were synthesized by commercial manufacturers. These overlapping cassettes were enzymatically stitched together into bacterial artificial chromosomes, which were attached in a stepwise fashion using a combination of restriction digestion and in vitro recombination. At the point where the molecules were "quarter genomes", they were transformed into yeast where in vivo recombination machinery assembled the full M. genitalium chromosome. This was only proof of synthesis, however, and not of a synthetic genome useful for supporting a living organism.

In 2010, the JCVI team reported the creation of a bacterium controlled by a fully synthetic genome [37]. The 1.08 mega-base pair genome of Mycoplasma mycoides was synthesized and transplanted into a M. capricolum recipient cell to create a synthetic bacterium nicknamed "Synthia". The genome was divided into 1,078 1-kb cassettes, which were synthesized by Blue Heron. These cassettes were assembled in yeast using in vivo homologous recombination using a three-step hierarchy that built fragments of ~10,000 bp, ~100,000 bp and finally the complete 1.08 mega-bp genome. These were then isolated carefully from contaminating yeast DNA and transplanted into the recipient bacteria. A selectable marker placed in the synthetic genome demonstrated that bacteria that grew after transplantation were controlled by the synthetic genome. The same team demonstrated that a semi-synthetic genome could be assembled and transplanted using a combination of natural and synthetic DNA.

Final assembly of large double-stranded DNA products into kilobase and megabase functional molecules, free of errors and with minimal cost remains a process bottleneck for gene and genome synthesis. In vitro recombination methods have been successfully used to assemble large DNA molecules on the order of tens of kilobases, but this method remains prone to errors and requires considerable homology be engineered into assembly fragments.

See Also

References

  1. Khorana HG, Agarwal KL, Büchi H, Caruthers MH, Gupta NK, Kleppe K, Kumar A, Otsuka E, RajBhandary UL, Van de Sande JH, Sgaramella V, Terao T, Weber H, and Yamada T. . pmid:4571075. PubMed HubMed [Khorana1972]
  2. Itakura K, Hirose T, Crea R, Riggs AD, Heyneker HL, Bolivar F, and Boyer HW. . pmid:412251. PubMed HubMed [Itakura1977]
  3. Goeddel DV, Kleid DG, Bolivar F, Heyneker HL, Yansura DG, Crea R, Hirose T, Kraszewski A, Itakura K, and Riggs AD. . pmid:85300. PubMed HubMed [Goeddell1979]
  4. Gibson DG, Young L, Chuang RY, Venter JC, Hutchison CA 3rd, and Smith HO. . pmid:19363495. PubMed HubMed [Gibson2009]
    oligonucleotide assembly in vitro

  5. Hughes RA, Miklos AE, and Ellington AD. . pmid:21601682. PubMed HubMed [Hughes2011]
    Gene Synthesis Review

  6. Czar MJ, Anderson JC, Bader JS, and Peccoud J. . pmid:19111926. PubMed HubMed [Czar2009]
    review

  7. Engler C, Kandzia R, and Marillonnet S. . pmid:18985154. PubMed HubMed [Engler2008]
    GoldenGate

  8. Weber E, Engler C, Gruetzner R, Werner S, and Marillonnet S. . pmid:21364738. PubMed HubMed [Weber2011]
    MoClo

  9. Werner S, Engler C, Weber E, Gruetzner R, and Marillonnet S. . pmid:22126803. PubMed HubMed [Werner2012]
    MoClo

  10. Sarrion-Perdigones A, Falconi EE, Zandalinas SI, Juárez P, Fernández-del-Carmen A, Granell A, and Orzaez D. . pmid:21750718. PubMed HubMed [SarrionPerdigones2011]
    GoldenBraid

  11. Holton TA and Graham MW. . pmid:2020554. PubMed HubMed [Holton1991]
    TA cloning

  12. Shuman S. . pmid:1658796. PubMed HubMed [Shuman1991]
    TOPO

  13. Stemmer WP, Crameri A, Ha KD, Brennan TM, and Heyneker HL. . pmid:7590320. PubMed HubMed [Stemmer1995]
    PCA

  14. Aslanidis C and de Jong PJ. . pmid:2235490. PubMed HubMed [Aslanidis1990]
    LIC

  15. Aslanidis C, de Jong PJ, and Schmitz G. . pmid:7580902. PubMed HubMed [Aslanidis1994]
    LIC

  16. Li C and Evans RM. . pmid:9321675. PubMed HubMed [Li1997]
    LIC

  17. Geu-Flores F, Nour-Eldin HH, Nielsen MT, and Halkier BA. . pmid:17389646. PubMed HubMed [GeuFlores2007]
    USER

  18. Higuchi R, Krummel B, and Saiki RK. . pmid:3045756. PubMed HubMed [Higuchi1988]
    SOE-PCR

  19. Horton RM, Cai ZL, Ho SN, and Pease LR. . pmid:2357375. PubMed HubMed [Horton2009]
    SOEing

  20. Li MZ and Elledge SJ. . pmid:17293868. PubMed HubMed [Li2007]
    SLIC

  21. Li MZ and Elledge SJ. . pmid:22328425. PubMed HubMed [Li2012]
    SLIC

  22. Zhu B, Cai G, Hall EO, and Freeman GJ. . pmid:17907578. PubMed HubMed [Zhu2007]
    in-fusion

  23. Benoit RM, Wilhelm RN, Scherer-Becker D, and Ostermeier C. . pmid:16289702. PubMed HubMed [Benoit2006]
    in-fusion

  24. Sleight SC, Bartley BA, Lieviant JA, and Sauro HM. . pmid:20385581. PubMed HubMed [Sleight2010]
    In-Fusion biobrick

  25. Hartley JL, Temple GF, and Brasch MA. . pmid:11076863. PubMed HubMed [Hartley2000]
    Gateway lambda Int

  26. Khalil AM, Julius JA, and Bachant J. . pmid:17702758. PubMed HubMed [Khalil2007]
    Gateway lambda Cre

  27. Quan J and Tian J. . pmid:19649325. PubMed HubMed [Quan2009]
    CPEC

    //T5 exonuclease recombination

  28. Zhang Y, Werling U, and Edelmann W. . pmid:22241772. PubMed HubMed [Zhang2012]
    SLiCe

  29. Gibson DG, Benders GA, Andrews-Pfannkoch C, Denisova EA, Baden-Tillson H, Zaveri J, Stockwell TB, Brownley A, Thomas DW, Algire MA, Merryman C, Young L, Noskov VN, Glass JI, Venter JC, Hutchison CA 3rd, and Smith HO. . pmid:18218864. PubMed HubMed [Gibson2008]
    Complete genome synthesis

  30. Gibson DG. . pmid:19745056. PubMed HubMed [Gibson2009b]
    oligonucleotide assembly in yeast

  31. Gibson DG. . pmid:21601685. PubMed HubMed [Gibson2011]
    MIE paper

  32. Gibson DG, Smith HO, Hutchison CA 3rd, Venter JC, and Merryman C. . pmid:20935651. PubMed HubMed [Gibson2010b]
  33. Li MZ and Elledge SJ. . pmid:15731760. PubMed HubMed [Li2005]
    MAGIC, bacterial mating approach

  34. Zhang Y, Buchholz F, Muyrers JP, and Stewart AF. . pmid:9771703. PubMed HubMed [Zhang1998]
    ET recombination

  35. Larionov V, Kouprina N, Graves J, Chen XN, Korenberg JR, and Resnick MA. . pmid:8552668. PubMed HubMed [Larionov1996]
    Transformation-associated recombination (TAR) cloning

  36. Tian J, Gong H, Sheng N, Zhou X, Gulari E, Gao X, and Church G. . pmid:15616567. PubMed HubMed [Tian2004]
  37. Gibson DG, Glass JI, Lartigue C, Noskov VN, Chuang RY, Algire MA, Benders GA, Montague MG, Ma L, Moodie MM, Merryman C, Vashee S, Krishnakumar R, Assad-Garcia N, Andrews-Pfannkoch C, Denisova EA, Young L, Qi ZQ, Segall-Shapiro TH, Calvey CH, Parmar PP, Hutchison CA 3rd, Smith HO, and Venter JC. . pmid:20488990. PubMed HubMed [Gibson2010]
    genome replacement

  38. j5 DNA Assembly Design Automation Software doi: 10.1021/sb2000116 [Hillson2012]
  39. Dymond JS, Richardson SM, Coombes CE, Babatz T, Muller H, Annaluru N, Blake WJ, Schwerzmann JW, Dai J, Lindstrom DL, Boeke AC, Gottschling DE, Chandrasegaran S, Bader JS, and Boeke JD. . pmid:21918511. PubMed HubMed [Dymond2011]
All Medline abstracts: PubMed HubMed
Personal tools