Phased Whole-genome Genetic Risk in a Family Quartet Using a Major Allele Reference Sequence
Overview
Authors
Affiliations
Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (< 1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.
The complete and fully-phased diploid genome of a male Han Chinese.
Yang C, Zhou Y, Song Y, Wu D, Zeng Y, Nie L Cell Res. 2023; 33(10):745-761.
PMID: 37452091 PMC: 10542383. DOI: 10.1038/s41422-023-00849-5.
Variant calling and benchmarking in an era of complete human genome sequences.
Olson N, Wagner J, Dwarshuis N, Miga K, Sedlazeck F, Salit M Nat Rev Genet. 2023; 24(7):464-483.
PMID: 37059810 DOI: 10.1038/s41576-023-00590-0.
Van Der Merwe N, Ramesar R, de Vries J Front Genet. 2022; 13:864822.
PMID: 35754817 PMC: 9216214. DOI: 10.3389/fgene.2022.864822.
Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses.
Kaminow B, Ballouz S, Gillis J, Dobin A Genome Res. 2022; 32(4):738-749.
PMID: 35256454 PMC: 8997357. DOI: 10.1101/gr.275613.121.
A guide for the diagnosis of rare and undiagnosed disease: beyond the exome.
Marwaha S, Knowles J, Ashley E Genome Med. 2022; 14(1):23.
PMID: 35220969 PMC: 8883622. DOI: 10.1186/s13073-022-01026-w.