» Articles » PMID: 15601529

Gametic Phase Estimation over Large Genomic Regions Using an Adaptive Window Approach

Overview
Journal Hum Genomics
Publisher Biomed Central
Specialty Genetics
Date 2004 Dec 17
PMID 15601529
Citations 39
Authors
Affiliations
Soon will be listed here.
Abstract

The authors present ELB, an easy to programme and computationally fast algorithm for inferring gametic phase in population samples of multilocus genotypes. Phase updates are made on the basis of a window of neighbouring loci, and the window size varies according to the local level of linkage disequilibrium. Thus, ELB is particularly well suited to problems involving many loci and/or relatively large genomic regions, including those with variable recombination rate. The authors have simulated population samples of single nucleotide polymorphism genotypes with varying levels of recombination and marker density, and find that ELB provides better local estimation of gametic phase than the PHASE or HTYPER programs, while its global accuracy is broadly similar. The relative improvement in local accuracy increases both with increasing recombination and with increasing marker density. Short tandem repeat (STR, or microsatellite) simulation studies demonstrate ELB's superiority over PHASE both globally and locally. Missing data are handled by ELB; simulations show that phase recovery is virtually unaffected by up to 2 per cent of missing data, but that phase estimation is noticeably impaired beyond this amount. The authors also applied ELB to datasets obtained from random pairings of 42 human X chromosomes typed at 97 diallelic markers in a 200 kb low-recombination region. Once again, they found ELB to have consistently better local accuracy than PHASE or HTYPER, while its global accuracy was close to the best.

Citing Articles

Genetic hypervariability of a Northeastern Atlantic venomous rockfish.

Francisco S, Castilho R, Lima C, Almada F, Rodrigues F, Sanda R PeerJ. 2021; 9:e11730.

PMID: 34306828 PMC: 8280884. DOI: 10.7717/peerj.11730.


Low expression alleles and tuberculosis in HIV infected South Africans.

Reid D, Shenoi S, Singh R, Wang M, Patel V, Das R Cytokine X. 2021; 1(1):100004.

PMID: 33604547 PMC: 7885893. DOI: 10.1016/j.cytox.2019.100004.


Contrasting population structure and demographic history of cereal aphids in different environmental and agricultural landscapes.

Morales-Hojas R, Sun J, Iraizoz F, Tan X, Chen J Ecol Evol. 2020; 10(18):9647-9662.

PMID: 33005337 PMC: 7520199. DOI: 10.1002/ece3.6565.


Major Histocompatibility Complex Class I Chain-Related A and B (MICA and MICB) Gene, Allele, and Haplotype Associations With Dengue Infections in Ethnic Thais.

Luangtrakool P, Vejbaesya S, Luangtrakool K, Ngamhawornwong S, Apisawes K, Kalayanarooj S J Infect Dis. 2020; 222(5):840-846.

PMID: 32737971 PMC: 7399699. DOI: 10.1093/infdis/jiaa134.


Time matters: genetic composition and evaluation of effective population size in temperate coastal fish species.

Francisco S, Robalo J PeerJ. 2020; 8:e9098.

PMID: 32391212 PMC: 7197400. DOI: 10.7717/peerj.9098.


References
1.
Tishkoff S, Pakstis A, Ruano G, Kidd K . The accuracy of statistical methods for estimation of haplotype frequencies: an example from the CD4 locus. Am J Hum Genet. 2000; 67(2):518-22. PMC: 1287198. DOI: 10.1086/303000. View

2.
Fallin D, Schork N . Accuracy of haplotype frequency estimation for biallelic loci, via the expectation-maximization algorithm for unphased diploid genotype data. Am J Hum Genet. 2000; 67(4):947-59. PMC: 1287896. DOI: 10.1086/303069. View

3.
Nickerson D, Taylor S, Fullerton S, Weiss K, Clark A, Stengard J . Sequence diversity and large-scale typing of SNPs in the human apolipoprotein E gene. Genome Res. 2000; 10(10):1532-45. PMC: 310963. DOI: 10.1101/gr.146900. View

4.
Spiegelman J, Mindrinos M, Oefner P . High-accuracy DNA sequence variation screening by DHPLC. Biotechniques. 2000; 29(5):1084-90, 1092. DOI: 10.2144/00295rr04. View

5.
Excoffier L, Novembre J, Schneider S . SIMCOAL: a general coalescent program for the simulation of molecular data in interconnected populations with arbitrary demography. J Hered. 2001; 91(6):506-9. DOI: 10.1093/jhered/91.6.506. View