» Articles » PMID: 37646508

Machine Learning-based Detection of Adventitious Microbes in T-cell Therapy Cultures Using Long-read Sequencing

Abstract

Assuring that cell therapy products are safe before releasing them for use in patients is critical. Currently, compendial sterility testing for bacteria and fungi can take 7-14 days. The goal of this work was to develop a rapid untargeted approach for the sensitive detection of microbial contaminants at low abundance from low volume samples during the manufacturing process of cell therapies. We developed a long-read sequencing methodology using Oxford Nanopore Technologies MinION platform with 16S and 18S amplicon sequencing to detect USP <71> organisms and other microbial species. Reads are classified metagenomically to predict the microbial species. We used an extreme gradient boosting machine learning algorithm (XGBoost) to first assess if a sample is contaminated, and second, determine whether the predicted contaminant is correctly classified or misclassified. The model was used to make a final decision on the sterility status of the input sample. An optimized experimental and bioinformatics pipeline starting from spiked species through to sequenced reads allowed for the detection of microbial samples at 10 colony-forming units (CFU)/mL using metagenomic classification. Machine learning can be coupled with long-read sequencing to detect and identify sample sterility status and microbial species present in T-cell cultures, including the USP <71> organisms to 10 CFU/mL. IMPORTANCE This research presents a novel method for rapidly and accurately detecting microbial contaminants in cell therapy products, which is essential for ensuring patient safety. Traditional testing methods are time-consuming, taking 7-14 days, while our approach can significantly reduce this time. By combining advanced long-read nanopore sequencing techniques and machine learning, we can effectively identify the presence and types of microbial contaminants at low abundance levels. This breakthrough has the potential to improve the safety and efficiency of cell therapy manufacturing, leading to better patient outcomes and a more streamlined production process.

Citing Articles

Acute ischemic stroke prediction and predictive factors analysis using hematological indicators in elderly hypertensives post-transient ischemic attack.

Shu C, Zheng C, Luo D, Song J, Jiang Z, Ge L Sci Rep. 2024; 14(1):695.

PMID: 38184714 PMC: 10771433. DOI: 10.1038/s41598-024-51402-2.

References
1.
Rodriguez-Perez H, Ciuffreda L, Flores C . NanoCLUST: a species-level analysis of 16S rRNA nanopore sequencing data. Bioinformatics. 2020; 37(11):1600-1601. DOI: 10.1093/bioinformatics/btaa900. View

2.
Nath S, Harper L, Rancourt D . Cell-Based Therapy Manufacturing in Stirred Suspension Bioreactor: Thoughts for cGMP Compliance. Front Bioeng Biotechnol. 2020; 8:599674. PMC: 7726241. DOI: 10.3389/fbioe.2020.599674. View

3.
Daly G, Leggett R, Rowe W, Stubbs S, Wilkinson M, Ramirez-Gonzalez R . Host Subtraction, Filtering and Assembly Validations for Novel Viral Discovery Using Next Generation Sequencing Data. PLoS One. 2015; 10(6):e0129059. PMC: 4476701. DOI: 10.1371/journal.pone.0129059. View

4.
Marti J . Recentrifuge: Robust comparative analysis and contamination removal for metagenomics. PLoS Comput Biol. 2019; 15(4):e1006967. PMC: 6472834. DOI: 10.1371/journal.pcbi.1006967. View

5.
Sanderson N, Street T, Foster D, Swann J, Atkins B, Brent A . Real-time analysis of nanopore-based metagenomic sequencing from infected orthopaedic devices. BMC Genomics. 2018; 19(1):714. PMC: 6161345. DOI: 10.1186/s12864-018-5094-y. View