Total Synthesis of Long DNA Sequences: Synthesis of a Contiguous 32-kb Polyketide Synthase Gene Cluster
Overview
Affiliations
To exploit the huge potential of whole-genome sequence information, the ability to efficiently synthesize long, accurate DNA sequences is becoming increasingly important. An approach proposed toward this end involves the synthesis of approximately 5-kb segments of DNA, followed by their assembly into longer sequences by conventional cloning methods [Smith, H. O., Hutchinson, C. A., III, Pfannkoch, C. & Venter, J. C. (2003) Proc. Natl. Acad. Sci. USA 100, 15440-15445]. The major current impediment to the success of this tactic is the difficulty of building the approximately 5-kb components accurately, efficiently, and rapidly from short synthetic oligonucleotide building blocks. We have developed and implemented a strategy for the high-throughput synthesis of long, accurate DNA sequences. Unpurified 40-base synthetic oligonucleotides are built into 500- to 800-bp "synthons" with low error frequency by automated PCR-based gene synthesis. By parallel processing, these synthons are efficiently joined into multisynthon approximately 5-kb segments by using only three endonucleases and "ligation by selection." These large segments can be subsequently assembled into very long sequences by conventional cloning. We validated the approach by building a synthetic 31,656-bp polyketide synthase gene cluster whose functionality was demonstrated by its ability to produce the megaenzyme and its polyketide product in Escherichia coli.
Design and structure of overlapping regions in PCA via deep learning.
Zheng Y, Cui X, Guo F, Dou M, Xie Z, Yuan Y Synth Syst Biotechnol. 2025; 10(2):442-451.
PMID: 39917768 PMC: 11799973. DOI: 10.1016/j.synbio.2024.12.007.
Inert splint-driven oligonucleotide assembly.
Mishin A, Groth T, Green R, Troll C Synth Biol (Oxf). 2024; 9(1):ysae019.
PMID: 39734808 PMC: 11671690. DOI: 10.1093/synbio/ysae019.
Sword T, Dinglasan J, Abbas G, Barker J, Spradley M, Greene E Sci Rep. 2024; 14(1):12983.
PMID: 38839808 PMC: 11153635. DOI: 10.1038/s41598-024-61376-w.
Nomura K, Onda K, Murase H, Hashiya F, Ono Y, Terai G RSC Chem Biol. 2024; 5(4):360-371.
PMID: 38576723 PMC: 10989509. DOI: 10.1039/d3cb00212h.
DNA synthesis technologies to close the gene writing gap.
Hoose A, Vellacott R, Storch M, Freemont P, Ryadnov M Nat Rev Chem. 2023; 7(3):144-161.
PMID: 36714378 PMC: 9869848. DOI: 10.1038/s41570-022-00456-9.