FASTAptamer: A Bioinformatic Toolkit for High-throughput Sequence Analysis of Combinatorial Selections
Overview
Affiliations
High-throughput sequence (HTS) analysis of combinatorial selection populations accelerates lead discovery and optimization and offers dynamic insight into selection processes. An underlying principle is that selection enriches high-fitness sequences as a fraction of the population, whereas low-fitness sequences are depleted. HTS analysis readily provides the requisite numerical information by tracking the evolutionary trajectory of individual sequences in response to selection pressures. Unlike genomic data, for which a number of software solutions exist, user-friendly tools are not readily available for the combinatorial selections field, leading many users to create custom software. FASTAptamer was designed to address the sequence-level analysis needs of the field. The open source FASTAptamer toolkit counts, normalizes and ranks read counts in a FASTQ file, compares populations for sequence distribution, generates clusters of sequence families, calculates fold-enrichment of sequences throughout the course of a selection and searches for degenerate sequence motifs. While originally designed for aptamer selections, FASTAptamer can be applied to any selection strategy that can utilize next-generation DNA sequencing, such as ribozyme or deoxyribozyme selections, in vivo mutagenesis and various surface display technologies (peptide, antibody fragment, mRNA, etc.). FASTAptamer software, sample data and a user's guide are available for download at http://burkelab.missouri.edu/fastaptamer.html.
Carter Jr C, Carter C, Tang G, Patra S, Betts L, Dieckhaus H bioRxiv. 2025; .
PMID: 39763899 PMC: 11702779. DOI: 10.1101/2024.12.17.628912.
Wang L, Canoura J, Byrd C, Nguyen T, Alkhamis O, Ly P ACS Cent Sci. 2024; 10(12):2213-2228.
PMID: 39735321 PMC: 11672540. DOI: 10.1021/acscentsci.4c01377.
AltaiR: a C toolkit for alignment-free and temporal analysis of multi-FASTA data.
Silva J, Pinho A, Pratas D Gigascience. 2024; 13.
PMID: 39589438 PMC: 11590114. DOI: 10.1093/gigascience/giae086.
Ruiz-Ciancio D, Veeramani S, Singh R, Embree E, Ortman C, Thiel K Mol Ther Nucleic Acids. 2024; 35(4):102358.
PMID: 39507401 PMC: 11539416. DOI: 10.1016/j.omtn.2024.102358.
Gruenke P, Mayer M, Aneja R, Schulze W, Song Z, Burke D ACS Infect Dis. 2024; 10(8):2637-2655.
PMID: 39016538 PMC: 11320578. DOI: 10.1021/acsinfecdis.3c00708.