CNEFinder: Finding Conserved Non-coding Elements in Genomes
Overview
Affiliations
Motivation: Conserved non-coding elements (CNEs) represent an enigmatic class of genomic elements which, despite being extremely conserved across evolution, do not encode for proteins. Their functions are still largely unknown. Thus, there exists a need to systematically investigate their roles in genomes. Towards this direction, identifying sets of CNEs in a wide range of organisms is an important first step. Currently, there are no tools published in the literature for systematically identifying CNEs in genomes.
Results: We fill this gap by presenting CNEFinder; a tool for identifying CNEs between two given DNA sequences with user-defined criteria. The results presented here show the tool's ability of identifying CNEs accurately and efficiently. CNEFinder is based on a k-mer technique for computing maximal exact matches. The tool thus does not require or compute whole-genome alignments or indexes, such as the suffix array or the Burrows Wheeler Transform (BWT), which makes it flexible to use on a wide scale.
Availability And Implementation: Free software under the terms of the GNU GPL (https://github.com/lorrainea/CNEFinder).
Peck L, Llewellyn T, Bennetot B, ODonnell S, Nowell R, Ryan M PLoS Biol. 2024; 22(12):e3002480.
PMID: 39637834 PMC: 11620798. DOI: 10.1371/journal.pbio.3002480.
Giant transposons promote strain heterogeneity in a major fungal pathogen.
Gluck-Thaler E, Forsythe A, Puerner C, Stajich J, Croll D, Cramer R bioRxiv. 2024; .
PMID: 38979181 PMC: 11230402. DOI: 10.1101/2024.06.28.601215.
A survey of k-mer methods and applications in bioinformatics.
Moeckel C, Mareboina M, Konnaris M, Chan C, Mouratidis I, Montgomery A Comput Struct Biotechnol J. 2024; 23:2289-2303.
PMID: 38840832 PMC: 11152613. DOI: 10.1016/j.csbj.2024.05.025.
Gluck-Thaler E, Vogan A Nucleic Acids Res. 2024; 52(10):5496-5513.
PMID: 38686785 PMC: 11162782. DOI: 10.1093/nar/gkae327.
Yu H, Li Y, Han W, Bao L, Liu F, Ma Y Nat Protoc. 2024; 19(6):1623-1678.
PMID: 38514839 DOI: 10.1038/s41596-024-00966-4.