Finding Motifs in the Twilight Zone
Overview
Authors
Affiliations
Motivation: Gene activity is often affected by binding transcription factors to short fragments in DNA sequences called motifs. Identification of subtle regulatory motifs in a DNA sequence is a difficult pattern recognition problem. In this paper we design a new motif finding algorithm that can detect very subtle motifs.
Results: We introduce the notion of a multiprofile and use it for finding subtle motifs in DNA sequences. Multiprofiles generalize the notion of a profile and allow one to detect subtle patterns that escape detection by the standard profiles. Our MULTIPROFILER algorithm outperforms other leading motif finding algorithms in a number of synthetic models. Moreover, it can be shown that in some previously studied motif models, MULTIPROFILER is capable of pushing the performance envelope to its theoretical limits.
Availability: http://www-cse.ucsd.edu/groups/bioinformatics/software.html
A Review on Planted (, d) Motif Discovery Algorithms for Medical Diagnose.
Mohanty S, Pattnaik P, Al-Absi A, Kang D Sensors (Basel). 2022; 22(3).
PMID: 35161949 PMC: 8838483. DOI: 10.3390/s22031204.
Automated assembly of centromeres from ultra-long error-prone reads.
Bzikadze A, Pevzner P Nat Biotechnol. 2020; 38(11):1309-1316.
PMID: 32665660 PMC: 10718184. DOI: 10.1038/s41587-020-0582-4.
Review of Different Sequence Motif Finding Algorithms.
Hashim F, Mabrouk M, Al-Atabany W Avicenna J Med Biotechnol. 2019; 11(2):130-148.
PMID: 31057715 PMC: 6490410.
PMS6MC: A Multicore Algorithm for Motif Discovery.
Bandyopadhyay S, Sahni S, Rajasekaran S Algorithms. 2014; 6(4):805-823.
PMID: 25309700 PMC: 4193679. DOI: 10.3390/a6040805.
PMS6: A Fast Algorithm for Motif Discovery.
Bandyopadhyay S, Sahni S, Rajasekaran S IEEE Int Conf Comput Adv Bio Med Sci. 2013; :1-6.
PMID: 23959399 PMC: 3744182. DOI: 10.1109/ICCABS.2012.6182627.