» Articles » PMID: 21572440

Full-length Transcriptome Assembly from RNA-Seq Data Without a Reference Genome

Abstract

Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.

Citing Articles

Contributions of interspecific hybrids to genetic variability in Glycyrrhiza uralensis and G. glabra.

Kim J, Lee J, Kang J, Shim H, Kang D, Lee S Sci Rep. 2025; 15(1):8764.

PMID: 40082484 PMC: 11906797. DOI: 10.1038/s41598-025-92115-4.


Unveiling the genetic diversity of the genera Enamovirus and Polerovirus through data-driven virus discovery.

Sidharthan V, Reddy V, Krishnan N, Parameswari B Arch Virol. 2025; 170(4):76.

PMID: 40080166 DOI: 10.1007/s00705-025-06258-w.


Integrated Analysis of Transcriptomics and Proteomics Provides Insights into the Accumulation Mechanism of Ascorbic Acid in Tratt.

Li P, Mu B, Liu J, Wu W, He C, Tan B Foods. 2025; 14(5).

PMID: 40077452 PMC: 11899413. DOI: 10.3390/foods14050748.


The Aldehyde Dehydrogenase Superfamily in L.: Genome-Wide Identification and Expression Analysis Under Low-Temperature Conditions.

Jin T, Wu C, Huang Z, Zhang X, Li S, Ding C Int J Mol Sci. 2025; 26(5).

PMID: 40076992 PMC: 11901046. DOI: 10.3390/ijms26052373.


Chromosome-level genome assembly of a specialist walnut pest Atrijuglans aristata.

Feng D, Sun C, Li Y, Gao Q, Wang G, Li H Sci Data. 2025; 12(1):434.

PMID: 40075062 PMC: 11904212. DOI: 10.1038/s41597-025-04754-x.


References
1.
Xu Z, Wei W, Gagneur J, Perocchi F, Clauder-Munster S, Camblong J . Bidirectional promoters generate pervasive transcription in yeast. Nature. 2009; 457(7232):1033-7. PMC: 2766638. DOI: 10.1038/nature07728. View

2.
Li R, Yu C, Li Y, Lam T, Yiu S, Kristiansen K . SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009; 25(15):1966-7. DOI: 10.1093/bioinformatics/btp336. View

3.
Price A, Jones N, Pevzner P . De novo identification of repeat families in large genomes. Bioinformatics. 2005; 21 Suppl 1:i351-8. DOI: 10.1093/bioinformatics/bti1018. View

4.
Salzberg S, Yorke J . Beware of mis-assembled genomes. Bioinformatics. 2005; 21(24):4320-1. DOI: 10.1093/bioinformatics/bti769. View

5.
Birol I, Jackman S, Nielsen C, Qian J, Varhol R, Stazyk G . De novo transcriptome assembly with ABySS. Bioinformatics. 2009; 25(21):2872-7. DOI: 10.1093/bioinformatics/btp367. View