» Articles » PMID: 23131050

LTRsift: a Graphical User Interface for Semi-automatic Classification and Postprocessing of De Novo Detected LTR Retrotransposons

Overview
Journal Mob DNA
Publisher Biomed Central
Specialty Genetics
Date 2012 Nov 8
PMID 23131050
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

Unlabelled:

Background: Long terminal repeat (LTR) retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-)families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets), making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow.

Results: We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort.

Conclusions: LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining the output of software for predicting LTR retrotransposons up to the stage of preparing full-length reference sequence libraries. The LTRsift software is freely available at http://www.zbh.uni-hamburg.de/LTRsift under an open-source license.

Citing Articles

Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning.

Orozco-Arias S, Isaza G, Guyot R Int J Mol Sci. 2019; 20(15).

PMID: 31390781 PMC: 6696364. DOI: 10.3390/ijms20153837.


LtrDetector: A tool-suite for detecting long terminal repeat retrotransposons de-novo.

Valencia J, Girgis H BMC Genomics. 2019; 20(1):450.

PMID: 31159720 PMC: 6547461. DOI: 10.1186/s12864-019-5796-9.


Inpactor, Integrated and Parallel Analyzer and Classifier of LTR Retrotransposons and Its Application for Pineapple LTR Retrotransposons Diversity and Dynamics.

Orozco-Arias S, Liu J, Tabares-Soto R, Ceballos D, Silva Domingues D, Garavito A Biology (Basel). 2018; 7(2).

PMID: 29799487 PMC: 6022998. DOI: 10.3390/biology7020032.


A machine learning based framework to identify and classify long terminal repeat retrotransposons.

Schietgat L, Vens C, Cerri R, Fischer C, Costa E, Ramon J PLoS Comput Biol. 2018; 14(4):e1006097.

PMID: 29684010 PMC: 5933816. DOI: 10.1371/journal.pcbi.1006097.


Genome-wide analysis of transposable elements in the coffee berry borer Hypothenemus hampei (Coleoptera: Curculionidae): description of novel families.

Hernandez-Hernandez E, Fernandez-Medina R, Navarro-Escalante L, Nunez J, Benavides-Machado P, Carareto C Mol Genet Genomics. 2017; 292(3):565-583.

PMID: 28204924 DOI: 10.1007/s00438-017-1291-7.


References
1.
Gentles A, Wakefield M, Kohany O, Gu W, Batzer M, Pollock D . Evolutionary dynamics of transposable elements in the short-tailed opossum Monodelphis domestica. Genome Res. 2007; 17(7):992-1004. PMC: 1899126. DOI: 10.1101/gr.6070707. View

2.
Eilbeck K, Lewis S, Mungall C, Yandell M, Stein L, Durbin R . The Sequence Ontology: a tool for the unification of genome annotations. Genome Biol. 2005; 6(5):R44. PMC: 1175956. DOI: 10.1186/gb-2005-6-5-r44. View

3.
Kielbasa S, Wan R, Sato K, Horton P, Frith M . Adaptive seeds tame genomic sequence comparison. Genome Res. 2011; 21(3):487-93. PMC: 3044862. DOI: 10.1101/gr.113985.110. View

4.
Marquet R, Isel C, Ehresmann C, Ehresmann B . tRNAs as primer of reverse transcriptases. Biochimie. 1995; 77(1-2):113-24. DOI: 10.1016/0300-9084(96)88114-4. View

5.
Steinbiss S, Gremme G, Scharfer C, Mader M, Kurtz S . AnnotationSketch: a genome annotation drawing library. Bioinformatics. 2008; 25(4):533-4. DOI: 10.1093/bioinformatics/btn657. View