» Articles » PMID: 26039588

Illumina Synthetic Long Read Sequencing Allows Recovery of Missing Sequences Even in the "Finished" C. Elegans Genome

Overview
Journal Sci Rep
Specialty Science
Date 2015 Jun 4
PMID 26039588
Citations 31
Authors
Affiliations
Soon will be listed here.
Abstract

Most next-generation sequencing platforms permit acquisition of high-throughput DNA sequences, but the relatively short read length limits their use in genome assembly or finishing. Illumina has recently released a technology called Synthetic Long-Read Sequencing that can produce reads of unusual length, i.e., predominately around 10 Kb. However, a systematic assessment of their use in genome finishing and assembly is still lacking. We evaluate the promise and deficiency of the long reads in these aspects using isogenic C. elegans genome with no gap. First, the reads are highly accurate and capable of recovering most types of repetitive sequences. However, the presence of tandem repetitive sequences prevents pre-assembly of long reads in the relevant genomic region. Second, the reads are able to reliably detect missing but not extra sequences in the C. elegans genome. Third, the reads of smaller size are more capable of recovering repetitive sequences than those of bigger size. Fourth, at least 40 Kbp missing genomic sequences are recovered in the C. elegans genome using the long reads. Finally, an N50 contig size of at least 86 Kbp can be achieved with 24 × reads but with substantial mis-assembly errors, highlighting a need for novel assembly algorithm for the long reads.

Citing Articles

CGC1, a new reference genome for .

Ichikawa K, Shoura M, Artiles K, Jeong D, Owa C, Kobayashi H bioRxiv. 2024; .

PMID: 39677790 PMC: 11643116. DOI: 10.1101/2024.12.04.626850.


Isolation, molecular identification, and genomic analysis of strain ASIOC01 from activated sludge harboring the bioremediation prowess of glycerol and organic pollutants in high-salinity.

Chin H, Ravi Varadharajulu N, Lin Z, Chen W, Zhang Z, Arumugam S Front Microbiol. 2024; 15:1415723.

PMID: 38983623 PMC: 11231211. DOI: 10.3389/fmicb.2024.1415723.


The Application of Metagenomics to Study Microbial Communities and Develop Desirable Traits in Fermented Foods.

Srinivas M, OSullivan O, Cotter P, Van Sinderen D, Kenny J Foods. 2023; 11(20).

PMID: 37431045 PMC: 9601669. DOI: 10.3390/foods11203297.


Genetic exchange with an outcrossing sister species causes severe genome-wide dysregulation in a selfing nematode.

Xie D, Ye P, Ma Y, Li Y, Liu X, Sarkies P Genome Res. 2022; 32(11-12):2015-2027.

PMID: 36351773 PMC: 9808620. DOI: 10.1101/gr.277205.122.


Genomic architecture of 5S rDNA cluster and its variations within and between species.

Ding Q, Li R, Ren X, Chan L, Ho V, Xie D BMC Genomics. 2022; 23(1):238.

PMID: 35346033 PMC: 8961926. DOI: 10.1186/s12864-022-08476-x.


References
1.
Hillier L, Marth G, Quinlan A, Dooling D, Fewell G, Barnett D . Whole-genome sequencing and variant discovery in C. elegans. Nat Methods. 2008; 5(2):183-8. DOI: 10.1038/nmeth.1179. View

2.
Weber K, De S, Kozarewa I, Turner D, Babu M, De Bono M . Whole genome sequencing highlights genetic changes associated with laboratory domestication of C. elegans. PLoS One. 2010; 5(11):e13922. PMC: 2978686. DOI: 10.1371/journal.pone.0013922. View

3.
Tsang W, Lemire B . Mitochondrial genome content is regulated during nematode development. Biochem Biophys Res Commun. 2002; 291(1):8-16. DOI: 10.1006/bbrc.2002.6394. View

4.
Li H, Durbin R . Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010; 26(5):589-95. PMC: 2828108. DOI: 10.1093/bioinformatics/btp698. View

5.
Bashir A, Klammer A, Robins W, Chin C, Webster D, Paxinos E . A hybrid approach for the automated finishing of bacterial genomes. Nat Biotechnol. 2012; 30(7):701-707. PMC: 3731737. DOI: 10.1038/nbt.2288. View