» Articles » PMID: 16393334

Fast-Find: a Novel Computational Approach to Analyzing Combinatorial Motifs

Overview
Publisher Biomed Central
Specialty Biology
Date 2006 Jan 6
PMID 16393334
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Many vital biological processes, including transcription and splicing, require a combination of short, degenerate sequence patterns, or motifs, adjacent to defined sequence features. Although these motifs occur frequently by chance, they only have biological meaning within a specific context. Identifying transcripts that contain meaningful combinations of patterns is thus an important problem, which existing tools address poorly.

Results: Here we present a new approach, Fast-FIND (Fast-Fully Indexed Nucleotide Database), that uses a relational database to support rapid indexed searches for arbitrary combinations of patterns defined either by sequence or composition. Fast-FIND is easy to implement, takes less than a second to search the entire Drosophila genome sequence for arbitrary patterns adjacent to sites of alternative polyadenylation, and is sufficiently fast to allow sensitivity analysis on the patterns. We have applied this approach to identify transcripts that contain combinations of sequence motifs for RNA-binding proteins that may regulate alternative polyadenylation.

Conclusion: Fast-FIND provides an efficient way to identify transcripts that are potentially regulated via alternative polyadenylation. We have used it to generate hypotheses about interactions between specific polyadenylation factors, which we will test experimentally.

Citing Articles

A brief overview of mathematical modeling of the within-host dynamics of Mycobacterium tuberculosis.

Chakraborty D, Batabyal S, Ganusov V Front Appl Math Stat. 2025; 10.

PMID: 39906541 PMC: 11793202. DOI: 10.3389/fams.2024.1355373.


Insights into the nuclear-organelle DNA integration in Cicuta virosa (Apiaceae) provided by complete plastid and mitochondrial genomes.

Park S, Hwang Y, Kim H, Choi K BMC Genomics. 2025; 26(1):102.

PMID: 39901091 PMC: 11792336. DOI: 10.1186/s12864-025-11230-8.


Genetic Evidence of Killer Whale Predation on White Sharks in Australia.

Reeves I, Weeks A, Towner A, Impey R, Fish J, Clark Z Ecol Evol. 2025; 15(1):e70786.

PMID: 39872902 PMC: 11770329. DOI: 10.1002/ece3.70786.


From macro to micro: De novo genomes of Aedes mosquitoes enable comparative genomics among close and distant relatives.

Morinaga G, Balcazar D, Badolo A, Iyaloo D, Tantely L, Mouillaud T bioRxiv. 2025; .

PMID: 39868221 PMC: 11760778. DOI: 10.1101/2025.01.13.632753.


AFFIPred: AlphaFold2 structure-based Functional Impact Prediction of missense variations.

Pir M, Timucin E Protein Sci. 2025; 34(2):e70030.

PMID: 39840793 PMC: 11751861. DOI: 10.1002/pro.70030.


References
1.
Dominski Z, Marzluff W . Formation of the 3' end of histone mRNA. Gene. 1999; 239(1):1-14. DOI: 10.1016/s0378-1119(99)00367-4. View

2.
Takagaki Y, Manley J . RNA recognition by the human polyadenylation factor CstF. Mol Cell Biol. 1997; 17(7):3907-14. PMC: 232243. DOI: 10.1128/MCB.17.7.3907. View

3.
Beaudoing E, Freier S, Wyatt J, Claverie J, Gautheret D . Patterns of variant polyadenylation signal usage in human genes. Genome Res. 2000; 10(7):1001-10. PMC: 310884. DOI: 10.1101/gr.10.7.1001. View

4.
Kent W . BLAT--the BLAST-like alignment tool. Genome Res. 2002; 12(4):656-64. PMC: 187518. DOI: 10.1101/gr.229202. View

5.
Chen , Kwong , Li . A Compression Algorithm for DNA Sequences and Its Applications in Genome Comparison. Genome Inform Ser Workshop Genome Inform. 2000; 10:51-61. View