» Articles » PMID: 25009556

A Consensus Approach to Vertebrate De Novo Transcriptome Assembly from RNA-seq Data: Assembly of the Duck (Anas Platyrhynchos) Transcriptome

Overview
Journal Front Genet
Date 2014 Jul 11
PMID 25009556
Citations 25
Authors
Affiliations
Soon will be listed here.
Abstract

For vertebrate organisms where a reference genome is not available, de novo transcriptome assembly enables a cost effective insight into the identification of tissue specific or differentially expressed genes and variation of the coding part of the genome. However, since there are a number of different tools and parameters that can be used to reconstruct transcripts, it is difficult to determine an optimal method. Here we suggest a pipeline based on (1) assessing the performance of three different assembly tools (2) using both single and multiple k-mer (MK) approaches (3) examining the influence of the number of reads used in the assembly (4) merging assemblies from different tools. We use an example dataset from the vertebrate Anas platyrhynchos domestica (Pekin duck). We find that taking a subset of data enables a robust assembly to be produced by multiple methods without the need for very high memory capacity. The use of reads mapped back to transcripts (RMBT) and CEGMA (Core Eukaryotic Genes Mapping Approach) provides useful metrics to determine the completeness of assembly obtained. For this dataset the use of MK in the assembly generated a more complete assembly as measured by greater number of RMBT and CEGMA score. Merged single k-mer assemblies are generally smaller but consist of longer transcripts, suggesting an assembly consisting of fewer fragmented transcripts. We suggest that the use of a subset of reads during assembly allows the relatively rapid investigation of assembly characteristics and can guide the user to the most appropriate transcriptome for particular downstream use. Transcriptomes generated by the compared assembly methods and the final merged assembly are freely available for download at http://dx.doi.org/10.6084/m9.figshare.1032613.

Citing Articles

Alkaloid production and response to natural adverse conditions in : transcriptome analyses.

Jazayeri S, Pooralinaghi M, Torres-Navarrete Y, Oviedo-Bayas B, Guerra I, Jacome D BioTechnologia (Pozn). 2023; 103(4):355-384.

PMID: 36685700 PMC: 9837557. DOI: 10.5114/bta.2022.120706.


Normalized Workflow to Optimize Hybrid De Novo Transcriptome Assembly for Non-Model Species: A Case Study in (Baker) Boiss.

Sheikh-Assadi M, Naderi R, Salami S, Kafi M, Fatahi R, Shariati V Plants (Basel). 2022; 11(18).

PMID: 36145766 PMC: 9503428. DOI: 10.3390/plants11182365.


Computational screening of miRNAs and their targets in saffron (Crocus sativus L.) by transcriptome mining.

Taheri-Dehkordi A, Naderi R, Martinelli F, Salami S Planta. 2021; 254(6):117.

PMID: 34751821 DOI: 10.1007/s00425-021-03761-7.


Metatranscriptomic Analysis of Bacterial Communities on Laundered Textiles: A Pilot Case Study.

Jacksch S, Konig C, Kaiser D, Weide M, Ratering S, Schnell S Microorganisms. 2021; 9(8).

PMID: 34442670 PMC: 8400938. DOI: 10.3390/microorganisms9081591.


Development of a relevant strategy using de novo transcriptome assembly method for transcriptome comparisons between Muscovy and common duck species and their reciprocal inter-specific mule and hinny hybrids fed ad libitum and overfed.

Liu X, Herault F, Diot C, Corre E BMC Genomics. 2020; 21(1):687.

PMID: 33008290 PMC: 7531116. DOI: 10.1186/s12864-020-07099-4.


References
1.
Zhao Q, Wang Y, Kong Y, Luo D, Li X, Hao P . Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study. BMC Bioinformatics. 2012; 12 Suppl 14:S2. PMC: 3287467. DOI: 10.1186/1471-2105-12-S14-S2. View

2.
Grabherr M, Haas B, Yassour M, Levin J, Thompson D, Amit I . Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7):644-52. PMC: 3571712. DOI: 10.1038/nbt.1883. View

3.
Mundry M, Bornberg-Bauer E, Sammeth M, Feulner P . Evaluating characteristics of de novo assembly software on 454 transcriptome data: a simulation approach. PLoS One. 2012; 7(2):e31410. PMC: 3288049. DOI: 10.1371/journal.pone.0031410. View

4.
Kumar S, Blaxter M . Comparing de novo assemblers for 454 transcriptome data. BMC Genomics. 2010; 11:571. PMC: 3091720. DOI: 10.1186/1471-2164-11-571. View

5.
Zerbino D, Birney E . Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008; 18(5):821-9. PMC: 2336801. DOI: 10.1101/gr.074492.107. View