» Articles » PMID: 18987735

The Diploid Genome Sequence of an Asian Individual

Abstract

Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.

Citing Articles

Efficient identification of genomic insertions and surrounding regions in two transgenic maize events using third-generation single-molecule nanopore sequencing technology.

Liu Q, Wang Q, Ning L, Chen Z, Zhang C, Liu Y Sci Rep. 2024; 14(1):31921.

PMID: 39738762 PMC: 11685737. DOI: 10.1038/s41598-024-83403-6.


Population genomics advances in frontier ethnic minorities in China.

Chen H, Xu S Sci China Life Sci. 2024; .

PMID: 39643831 DOI: 10.1007/s11427-024-2659-2.


Review of the technology used for structural characterization of the GMO genome using NGS data.

Moon K, Basnet P, Um T, Choi I Genomics Inform. 2024; 22(1):14.

PMID: 39358775 PMC: 11445869. DOI: 10.1186/s44342-024-00016-1.


Haplotype-resolved Chinese male genome assembly based on high-fidelity sequencing.

Yang X, Zhao X, Qu S, Jia P, Wang B, Gao S Fundam Res. 2024; 2(6):946-953.

PMID: 38933383 PMC: 11197534. DOI: 10.1016/j.fmre.2022.02.005.


Genome-Wide Identification of Specific Genetic Loci Common to Sheep and Goat.

Liang Z, Yue X, Liu Y, Ye M, Zhong L, Luan Y Biomolecules. 2024; 14(6).

PMID: 38927042 PMC: 11201639. DOI: 10.3390/biom14060638.


References
1.
Levy S, Sutton G, Ng P, Feuk L, Halpern A, Walenz B . The diploid genome sequence of an individual human. PLoS Biol. 2007; 5(10):e254. PMC: 1964779. DOI: 10.1371/journal.pbio.0050254. View

2.
Zerbino D, Birney E . Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 2008; 18(5):821-9. PMC: 2336801. DOI: 10.1101/gr.074492.107. View

3.
Coon K, Myers A, Craig D, Webster J, Pearson J, Lince D . A high-density whole-genome association study reveals that APOE is the major susceptibility gene for sporadic late-onset Alzheimer's disease. J Clin Psychiatry. 2007; 68(4):613-8. DOI: 10.4088/jcp.v68n0419. View

4.
Kidd J, Cooper G, Donahue W, Hayden H, Sampas N, Graves T . Mapping and sequencing of structural variation from eight human genomes. Nature. 2008; 453(7191):56-64. PMC: 2424287. DOI: 10.1038/nature06862. View

5.
Sherry S, Ward M, Kholodov M, Baker J, Phan L, Smigielski E . dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2000; 29(1):308-11. PMC: 29783. DOI: 10.1093/nar/29.1.308. View