» Articles » PMID: 33288906

Fully Phased Human Genome Assembly Without Parental Data Using Single-cell Strand Sequencing and Long Reads

Abstract

Human genomes are typically assembled as consensus sequences that lack information on parental haplotypes. Here we describe a reference-free workflow for diploid de novo genome assembly that combines the chromosome-wide phasing and scaffolding capabilities of single-cell strand sequencing with continuous long-read or high-fidelity sequencing data. Employing this strategy, we produced a completely phased de novo genome assembly for each haplotype of an individual of Puerto Rican descent (HG00733) in the absence of parental data. The assemblies are accurate (quality value > 40) and highly contiguous (contig N50 > 23 Mbp) with low switch error rates (0.17%), providing fully phased single-nucleotide variants, indels and structural variants. A comparison of Oxford Nanopore Technologies and Pacific Biosciences phased assemblies identified 154 regions that are preferential sites of contig breaks, irrespective of sequencing technology or phasing algorithms.

Citing Articles

A telomere-to-telomere phased genome of an octoploid strawberry reveals a receptor kinase conferring anthracnose resistance.

Han H, Salinas N, Barbey C, Jang Y, Fan Z, Verma S Gigascience. 2025; 14.

PMID: 40072904 PMC: 11899574. DOI: 10.1093/gigascience/giaf005.


Locityper: targeted genotyping of complex polymorphic genes.

Prodanov T, Plender E, Seebohm G, Meuth S, Eichler E, Marschall T bioRxiv. 2025; .

PMID: 39990346 PMC: 11844405. DOI: 10.1101/2024.05.03.592358.


Comparisons of performances of structural variants detection algorithms in solitary or combination strategy.

Duan D, Cheng C, Huang Y, Chung A, Chen P, Chen Y PLoS One. 2025; 20(2):e0314982.

PMID: 39913463 PMC: 11801633. DOI: 10.1371/journal.pone.0314982.


Novel insight of the SVP gene involved in pedicel length based on genomics analysis in cherry.

Tan W, Zhou P, Huang X, Wang Z, Liao R, Hayat F Plant Cell Rep. 2025; 44(2):50.

PMID: 39907812 DOI: 10.1007/s00299-025-03439-4.


Highly accurate Korean draft genomes reveal structural variation highlighting human telomere evolution.

Kim J, Park J, Yang J, Kim S, Joe S, Park G Nucleic Acids Res. 2025; 53(1.

PMID: 39778865 PMC: 11707537. DOI: 10.1093/nar/gkae1294.