A clean or minimal genome refers to the minimum set of genes that an organism needs to survive and reproduce. This implies that there are genes that are “nonessential” to the organism’s survival and can be removed without destroying the cell or disrupting its growth cycle. Examples of nonessential DNA would include duplicate genes, transposable elements and catabolic pathways used for the intake and breakdown of certain complex biomolecules and would be removed from a minimal genome.
A reduced genome is..............................
Advantages of a Minimal Genome
As the complexity of synthetic biology projects increases, so too will the problems it will run into due to the somewhat inherent randomness inside each cell. A minimal genome would give synthetic biologists a reliable and predictable “chassis” that could provide an ideal platform for perusing new research.
Current model organisms used in modern research contain deleterious DNA and gene products (like insertion sequences and unforeseen molecular interactions) which could disable or prohibit future endeavors by interacting unfavorably with whatever the “subject of interest” is. Smaller amounts of extraneous gene products would simplify and reduce the cost of extraction and purification of cell parts, biomolecules and pharmaceuticals, many of which would be more expensive and time consuming to separate from a population of cells by conventional means.
Gene stability in reduced genomes has been improved by removing transposable elements (TEs), error prone DNA polymerases and the enzymes responsible for the SOS response. Spontaneous, random genetic changes would be extremely harmful toward a minimal cell lacking a number of redundant systems. A stable genome would be a very desirable trait for research and experiment replication.
Future advances in genome replication, unnatural amino acids, drug development, fuel production and biomaterial synthesis could be accelerated by simplifying and streamlining the genome used to code for a cell.
Minimal/Clean Genomes vs. Wild Type Genomes vs. Reduced Genomes
Estimating the Number of Essential Genes
There are several different methods used for estimating the minimum number of essential genes an organism needs to survive in a controlled environment. Each method has its own shortcomings which limits their applications.
Comparative genomics looks for genomic homology between different organisms. Genetic homology over a wide number of similar organisms could be an indicator of essential genes since they were conserved throughout those strains or species. Unfortunately, a comparative approach could underestimate the number of essential genes since in only accounts for true genetic orthologs. For example, this gene estimate would not account for genes with different morphologies that code for functionally similar gene products. In some instances, it could also underestimate the number of essential genes since homologous genes do not have to be useful or essential. For example, certain virulence factors that are homologous in many pathogenic microbes are not essential genes. 
Gene Disruption using Transposable Mutagenesis
Targeted gene disruption using Transposable mutagenesis involves attempting to inactivate genes using a large number of transposable elements, then sequencing the resulting genome. Theoretically, if transposable elements are unable to insert themselves into a gene, then those genes must be more essential to the cell than other genes that are susceptible to disruption. Some of the genes screened may read a false positive for essentiality since there is the chance that some transducable genes may not have been transduced. Also, one transposon may disable multiple genes (like in alternatively spliced genes). An essential gene could also function normally with a transposable element inside it.
mRNA Disruption using Antisense RNA
Antisense RNA (asRNA) is a single strand of RNA that complementary to an mRNA inside a cell. When antisense RNA base pairs with mRNA, the mRNA is unable to be translated. Cells that can't survive in the presence of certain asRNAs indicate that those gene products are essential to cellular survival.
One approach to creating more reliable, efficient host organisms for synthetic constructs is the reduction of the genome to eliminate extraneous genes, mutagenic mobile elements, and other unnecessary or destabilizing factors. This can be viewed as a form of reverse engineering of extant strains.
Systematic Genome Reduction
One natural approach to engineering strains with a reduced genome is to systematically identify and delete regions of the genome not necessary for host cell survival. Posfai et al. created the MDS strains (multiple deletion strains) by aligning the genomes of multiple genomes of E. coli, identifying regions which were absent in multiple strains, and deleting them via Lambda Red recombination. All IS elements were removed as well, lowering the mutation rate and increasing the stability of genetic constructs introduced into the cell. The strain had comparable growth rate compared to wild type.
Ara et al. constructed a minimal version of the B. subtilis genome in 2007.  This strain had slightly decreased growth rate compared to wild-type, but displayed normal morphology and similar protein production capabilities.
Selection for Reduced Genome
Long-term evolution of strains under the correct conditions could select for a genome of minimal size. Such conditions may include growth in rich media lacking sugars to favor the loss of biosynthetic pathways or sugar metabolism operons, growth in structured environments which favor a smaller cell, or growth under other conditions which favor the loss of unnecessary genes.
Minimal Genome Synthesis
Another approach is the synthesis of a minimal, designed genome from scratch using DNA synthesis technology and the transformation of this genome into cells to create a viable, novel, synthetic organism. This approach can be viewed as forward engineering of a novel organism, but would likely be informed by studies which determine the minimal set of genes necessary for a living organism.
Mycoplasma mycoides Synthesis
Gibson et al synthesized the first artificial cell by generating the Mycoplasma mycoides genome from digitized genome information and transforming it into Mycoplasma capricolum cells devoid of genomic information. These cells were capable of continuous self-replication and were identified by "watermarks" inserted in the genome. This technology could be utilized in the future to create cells with novel and useful properties from scratch.
Error fetching PMID 18218864:
Error fetching PMID 20488990:
Error fetching PMID 11932248:
Error fetching PMID 16645050:
Error fetching PMID 16924266:
Error fetching PMID 10591650:
Error fetching PMID 20638265:
Error fetching PMID 11932248:
Error fetching PMID 21658106:
Error fetching PMID 12682299:
Error fetching PMID 20093288:
- Error fetching PMID 16924266:
- Error fetching PMID 10591650:
- Error fetching PMID 20638265:
- Error fetching PMID 11932248:
- Error fetching PMID 11932248:
- Error fetching PMID 21658106:
- Error fetching PMID 12682299:
- Error fetching PMID 20093288:
- C. M. Trepod,J. E. Mott. Elucidation of Essential and Nonessential Genes in the Haemophilus influenzae Rd Cell Wall Biosynthetic Pathway by Targeted Gene Disruption Antimicrob Agents Chemother. 2005 February; 49(2): 824–826
- Error fetching PMID 16645050:
- Error fetching PMID 17115975:
- Error fetching PMID 18218864:
- Error fetching PMID 20488990: