» Articles » PMID: 36227538

Design and Analysis of Massively Parallel Reporter Assays Using FORECAST

Overview
Specialty Molecular Biology
Date 2022 Oct 13
PMID 36227538
Authors
Affiliations
Soon will be listed here.
Abstract

Machine learning is revolutionizing molecular biology and bioengineering by providing powerful insights and predictions. Massively parallel reporter assays (MPRAs) have emerged as a particularly valuable class of high-throughput technique to support such algorithms. MPRAs enable the simultaneous characterization of thousands or even millions of genetic constructs and provide the large amounts of data needed to train models. However, while the scale of this approach is impressive, the design of effective MPRA experiments is challenging due to the many factors that can be varied and the difficulty in predicting how these will impact the quality and quantity of data obtained. Here, we present a computational tool called FORECAST, which can simulate MPRA experiments based on fluorescence-activated cell sorting and subsequent sequencing (commonly referred to as Flow-seq or Sort-seq experiments), as well as carry out rigorous statistical estimation of construct performance from this type of experimental data. FORECAST can be used to develop workflows to aid the design of MPRA experiments and reanalyze existing MPRA data sets.

Citing Articles

Data hazards in synthetic biology.

Zelenka N, Di Cara N, Sharma K, Sarvaharman S, Ghataora J, Parmeggiani F Synth Biol (Oxf). 2024; 9(1):ysae010.

PMID: 38973982 PMC: 11227101. DOI: 10.1093/synbio/ysae010.


Transfer learning for cross-context prediction of protein expression from 5'UTR sequence.

Gilliot P, Gorochowski T Nucleic Acids Res. 2024; 52(13):e58.

PMID: 38864396 PMC: 11260469. DOI: 10.1093/nar/gkae491.


Applications of artificial intelligence and machine learning in dynamic pathway engineering.

Merzbacher C, Oyarzun D Biochem Soc Trans. 2023; 51(5):1871-1879.

PMID: 37656433 PMC: 10657174. DOI: 10.1042/BST20221542.

References
1.
Nielsen A, Der B, Shin J, Vaidyanathan P, Paralanov V, Strychalski E . Genetic circuit design automation. Science. 2016; 352(6281):aac7341. DOI: 10.1126/science.aac7341. View

2.
Brophy J, Voigt C . Principles of genetic circuit design. Nat Methods. 2014; 11(5):508-20. PMC: 4230274. DOI: 10.1038/nmeth.2926. View

3.
Ellis T, Wang X, Collins J . Diversity-based, model-guided construction of synthetic gene networks with predicted functions. Nat Biotechnol. 2009; 27(5):465-71. PMC: 2680460. DOI: 10.1038/nbt.1536. View

4.
Ajo-Franklin C, Drubin D, Eskin J, Gee E, Landgraf D, Phillips I . Rational design of memory in eukaryotic cells. Genes Dev. 2007; 21(18):2271-6. PMC: 1973140. DOI: 10.1101/gad.1586107. View

5.
Zong Y, Zhang H, Lyu C, Ji X, Hou J, Guo X . Insulated transcriptional elements enable precise design of genetic circuits. Nat Commun. 2017; 8(1):52. PMC: 5495784. DOI: 10.1038/s41467-017-00063-z. View