» Articles » PMID: 34551715

Unveiling the Transcriptomic Complexity of Miscanthus Sinensis Using a Combination of PacBio Long Read- and Illumina Short Read Sequencing Platforms

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2021 Sep 23
PMID 34551715
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Miscanthus sinensis Andersson is a perennial grass that exhibits remarkable lignocellulose characteristics suitable for sustainable bioenergy production. However, knowledge of the genetic resources of this species is relatively limited, which considerably hampers further work on its biology and genetic improvement.

Results: In this study, through analyzing the transcriptome of mixed samples of leaves and stems using the latest PacBio Iso-Seq sequencing technology combined with Illumina HiSeq, we report the first full-length transcriptome dataset of M. sinensis with a total of 58.21 Gb clean data. An average of 15.75 Gb clean reads of each sample were obtained from the PacBio Iso-Seq system, which doubled the data size (6.68 Gb) obtained from the Illumina HiSeq platform. The integrated analyses of PacBio- and Illumina-based transcriptomic data uncovered 408,801 non-redundant transcripts with an average length of 1,685 bp. Of those, 189,406 transcripts were commonly identified by both methods, 169,149 transcripts with an average length of 619 bp were uniquely identified by Illumina HiSeq, and 51,246 transcripts with an average length of 2,535 bp were uniquely identified by PacBio Iso-Seq. Approximately 96 % of the final combined transcripts were mapped back to the Miscanthus genome, reflecting the high quality and coverage of our sequencing results. When comparing our data with genomes of four species of Andropogoneae, M. sinensis showed the closest relationship with sugarcane with up to 93 % mapping ratios, followed by sorghum with up to 80 % mapping ratios, indicating a high conservation of orthologs in these three genomes. Furthermore, 306,228 transcripts were successfully annotated against public databases including cell wall related genes and transcript factor families, thus providing many new insights into gene functions. The PacBio Iso-Seq data also helped identify 3,898 alternative splicing events and 2,963 annotated AS isoforms within 10 function categories.

Conclusions: Taken together, the present study provides a rich data set of full-length transcripts that greatly enriches our understanding of M. sinensis transcriptomic resources, thus facilitating further genetic improvement and molecular studies of the Miscanthus species.

Citing Articles

Full-Length Transcriptome Characterization and Functional Analysis of Pathogenesis-Related Proteins in Oriental Hybrid 'Sorbonne' Infected with .

Du W, Chai N, Sun Z, Wang H, Liu S, Sui S Int J Mol Sci. 2023; 24(1).

PMID: 36613869 PMC: 9820132. DOI: 10.3390/ijms24010425.


Integrated Full-Length Transcriptome and MicroRNA Sequencing Approaches Provide Insights Into Salt Tolerance in Mangrove ( Buch.-Ham.).

Chen B, Ding Z, Zhou X, Wang Y, Huang F, Sun J Front Genet. 2022; 13:932832.

PMID: 35899202 PMC: 9310009. DOI: 10.3389/fgene.2022.932832.

References
1.
Fu L, Niu B, Zhu Z, Wu S, Li W . CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012; 28(23):3150-2. PMC: 3516142. DOI: 10.1093/bioinformatics/bts565. View

2.
Marquez Y, Brown J, Simpson C, Barta A, Kalyna M . Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Genome Res. 2012; 22(6):1184-95. PMC: 3371709. DOI: 10.1101/gr.134106.111. View

3.
Harris M, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R . The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2003; 32(Database issue):D258-61. PMC: 308770. DOI: 10.1093/nar/gkh036. View

4.
Bolger M, Arsova B, Usadel B . Plant genome and transcriptome annotations: from misconceptions to simple solutions. Brief Bioinform. 2017; 19(3):437-449. PMC: 5952960. DOI: 10.1093/bib/bbw135. View

5.
Simao F, Waterhouse R, Ioannidis P, Kriventseva E, Zdobnov E . BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015; 31(19):3210-2. DOI: 10.1093/bioinformatics/btv351. View