» Articles » PMID: 21935354

Phased Whole-genome Genetic Risk in a Family Quartet Using a Major Allele Reference Sequence

Abstract

Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (< 1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.

Citing Articles

The complete and fully-phased diploid genome of a male Han Chinese.

Yang C, Zhou Y, Song Y, Wu D, Zeng Y, Nie L Cell Res. 2023; 33(10):745-761.

PMID: 37452091 PMC: 10542383. DOI: 10.1038/s41422-023-00849-5.


Variant calling and benchmarking in an era of complete human genome sequences.

Olson N, Wagner J, Dwarshuis N, Miga K, Sedlazeck F, Salit M Nat Rev Genet. 2023; 24(7):464-483.

PMID: 37059810 DOI: 10.1038/s41576-023-00590-0.


Whole Exome Sequencing in South Africa: Stakeholder Views on Return of Individual Research Results and Incidental Findings.

Van Der Merwe N, Ramesar R, de Vries J Front Genet. 2022; 13:864822.

PMID: 35754817 PMC: 9216214. DOI: 10.3389/fgene.2022.864822.


Pan-human consensus genome significantly improves the accuracy of RNA-seq analyses.

Kaminow B, Ballouz S, Gillis J, Dobin A Genome Res. 2022; 32(4):738-749.

PMID: 35256454 PMC: 8997357. DOI: 10.1101/gr.275613.121.


A guide for the diagnosis of rare and undiagnosed disease: beyond the exome.

Marwaha S, Knowles J, Ashley E Genome Med. 2022; 14(1):23.

PMID: 35220969 PMC: 8883622. DOI: 10.1186/s13073-022-01026-w.


References
1.
Degner J, Marioni J, Pai A, Pickrell J, Nkadori E, Gilad Y . Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics. 2009; 25(24):3207-12. PMC: 2788925. DOI: 10.1093/bioinformatics/btp579. View

2.
Kimura M . Evolutionary rate at the molecular level. Nature. 1968; 217(5129):624-6. DOI: 10.1038/217624a0. View

3.
Bell C, Dinwiddie D, Miller N, Hateley S, Ganusova E, Mudge J . Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci Transl Med. 2011; 3(65):65ra4. PMC: 3740116. DOI: 10.1126/scitranslmed.3001756. View

4.
Nelson M, Bryc K, King K, Indap A, Boyko A, Novembre J . The Population Reference Sample, POPRES: a resource for population, disease, and pharmacological genetics research. Am J Hum Genet. 2008; 83(3):347-58. PMC: 2556436. DOI: 10.1016/j.ajhg.2008.08.005. View

5.
Scott S, Sangkuhl K, Gardner E, Stein C, Hulot J, Johnson J . Clinical Pharmacogenetics Implementation Consortium guidelines for cytochrome P450-2C19 (CYP2C19) genotype and clopidogrel therapy. Clin Pharmacol Ther. 2011; 90(2):328-32. PMC: 3234301. DOI: 10.1038/clpt.2011.132. View