» Articles » PMID: 30423090

CNEFinder: Finding Conserved Non-coding Elements in Genomes

Overview
Journal Bioinformatics
Specialty Biology
Date 2018 Nov 14
PMID 30423090
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Conserved non-coding elements (CNEs) represent an enigmatic class of genomic elements which, despite being extremely conserved across evolution, do not encode for proteins. Their functions are still largely unknown. Thus, there exists a need to systematically investigate their roles in genomes. Towards this direction, identifying sets of CNEs in a wide range of organisms is an important first step. Currently, there are no tools published in the literature for systematically identifying CNEs in genomes.

Results: We fill this gap by presenting CNEFinder; a tool for identifying CNEs between two given DNA sequences with user-defined criteria. The results presented here show the tool's ability of identifying CNEs accurately and efficiently. CNEFinder is based on a k-mer technique for computing maximal exact matches. The tool thus does not require or compute whole-genome alignments or indexes, such as the suffix array or the Burrows Wheeler Transform (BWT), which makes it flexible to use on a wide scale.

Availability And Implementation: Free software under the terms of the GNU GPL (https://github.com/lorrainea/CNEFinder).

Citing Articles

Horizontal transfers between fungal Fusarium species contributed to successive outbreaks of coffee wilt disease.

Peck L, Llewellyn T, Bennetot B, ODonnell S, Nowell R, Ryan M PLoS Biol. 2024; 22(12):e3002480.

PMID: 39637834 PMC: 11620798. DOI: 10.1371/journal.pbio.3002480.


Giant transposons promote strain heterogeneity in a major fungal pathogen.

Gluck-Thaler E, Forsythe A, Puerner C, Stajich J, Croll D, Cramer R bioRxiv. 2024; .

PMID: 38979181 PMC: 11230402. DOI: 10.1101/2024.06.28.601215.


A survey of k-mer methods and applications in bioinformatics.

Moeckel C, Mareboina M, Konnaris M, Chan C, Mouratidis I, Montgomery A Comput Struct Biotechnol J. 2024; 23:2289-2303.

PMID: 38840832 PMC: 11152613. DOI: 10.1016/j.csbj.2024.05.025.


Systematic identification of cargo-mobilizing genetic elements reveals new dimensions of eukaryotic diversity.

Gluck-Thaler E, Vogan A Nucleic Acids Res. 2024; 52(10):5496-5513.

PMID: 38686785 PMC: 11162782. DOI: 10.1093/nar/gkae327.


Pan-evolutionary and regulatory genome architecture delineated by an integrated macro- and microsynteny approach.

Yu H, Li Y, Han W, Bao L, Liu F, Ma Y Nat Protoc. 2024; 19(6):1623-1678.

PMID: 38514839 DOI: 10.1038/s41596-024-00966-4.


References
1.
Sandelin A, Alkema W, Engstrom P, Wasserman W, Lenhard B . JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res. 2003; 32(Database issue):D91-4. PMC: 308747. DOI: 10.1093/nar/gkh012. View

2.
Schwartz S, Kent W, Smit A, Zhang Z, Baertsch R, Hardison R . Human-mouse alignments with BLASTZ. Genome Res. 2003; 13(1):103-7. PMC: 430961. DOI: 10.1101/gr.809403. View

3.
Dimitrieva S, Bucher P . UCNEbase--a database of ultraconserved non-coding elements and genomic regulatory blocks. Nucleic Acids Res. 2012; 41(Database issue):D101-9. PMC: 3531063. DOI: 10.1093/nar/gks1092. View

4.
Khiste N, Ilie L . E-MEM: efficient computation of maximal exact matches for very large genomes. Bioinformatics. 2014; 31(4):509-14. DOI: 10.1093/bioinformatics/btu687. View

5.
Woolfe A, Goode D, Cooke J, Callaway H, Smith S, Snell P . CONDOR: a database resource of developmentally associated conserved non-coding elements. BMC Dev Biol. 2007; 7:100. PMC: 2020477. DOI: 10.1186/1471-213X-7-100. View