» Articles » PMID: 33414550

Efficient Phasing and Imputation of Low-coverage Sequencing Data Using Large Reference Panels

Overview
Journal Nat Genet
Specialty Genetics
Date 2021 Jan 8
PMID 33414550
Citations 145
Authors
Affiliations
Soon will be listed here.
Abstract

Low-coverage whole-genome sequencing followed by imputation has been proposed as a cost-effective genotyping approach for disease and population genetics studies. However, its competitiveness against SNP arrays is undermined because current imputation methods are computationally expensive and unable to leverage large reference panels. Here, we describe a method, GLIMPSE, for phasing and imputation of low-coverage sequencing datasets from modern reference panels. We demonstrate its remarkable performance across different coverages and human populations. GLIMPSE achieves imputation of a genome for less than US$1 in computational cost, considerably outperforming other methods and improving imputation accuracy over the full allele frequency range. As a proof of concept, we show that 1× coverage enables effective gene expression association studies and outperforms dense SNP arrays in rare variant burden tests. Overall, this study illustrates the promising potential of low-coverage imputation and suggests a paradigm shift in the design of future genomic studies.

Citing Articles

High continuity of forager ancestry in the Neolithic period of the eastern Maghreb.

Lipson M, Ringbauer H, Lucarini G, Aouadi N, Aoudia L, Belhouchet L Nature. 2025; .

PMID: 40074896 DOI: 10.1038/s41586-025-08699-4.


Using genotype imputation to integrate Canola populations for genome-wide association and genomic prediction of blackleg resistance.

Zhao H, MacLeod I, Keeble-Gagnere G, Barbulescu D, Tibbits J, Kaur S BMC Genomics. 2025; 26(1):215.

PMID: 40038585 PMC: 11877698. DOI: 10.1186/s12864-025-11250-4.


Ancient genomes reveal trans-Eurasian connections between the European Huns and the Xiongnu Empire.

Gnecchi-Ruscone G, Racz Z, Liccardo S, Lee J, Huang Y, Traverso L Proc Natl Acad Sci U S A. 2025; 122(9):e2418485122.

PMID: 39993190 PMC: 11892651. DOI: 10.1073/pnas.2418485122.


Multi-omics analysis in primary T cells elucidates mechanisms behind disease-associated genetic loci.

Shi C, Zhao D, Butler J, Frantzeskos A, Rossi S, Ding J Genome Biol. 2025; 26(1):26.

PMID: 39930543 PMC: 11808986. DOI: 10.1186/s13059-025-03492-y.


Genomic Landscape and Prediction of Udder Traits in Saanen Dairy Goats.

Yao X, Li J, Fu J, Wang X, Ma L, Nanaei H Animals (Basel). 2025; 15(2).

PMID: 39858261 PMC: 11759135. DOI: 10.3390/ani15020261.


References
1.
Brody J, Morrison A, Bis J, OConnell J, Brown M, Huffman J . Analysis commons, a team approach to discovery in a big-data environment for genetic epidemiology. Nat Genet. 2017; 49(11):1560-1563. PMC: 5720686. DOI: 10.1038/ng.3968. View

2.
Buerkle C, Gompert Z . Population genomics based on low coverage sequencing: how low should we go?. Mol Ecol. 2012; 22(11):3028-35. DOI: 10.1111/mec.12105. View

3.
Le S, Durbin R . SNP detection and genotyping from low-coverage sequencing data on multiple diploid samples. Genome Res. 2010; 21(6):952-60. PMC: 3106328. DOI: 10.1101/gr.113084.110. View

4.
Pasaniuc B, Rohland N, McLaren P, Garimella K, Zaitlen N, Li H . Extremely low-coverage sequencing and imputation increases power for genome-wide association studies. Nat Genet. 2012; 44(6):631-5. PMC: 3400344. DOI: 10.1038/ng.2283. View

5.
Gilly A, Ritchie G, Southam L, Farmaki A, Tsafantakis E, Dedoussis G . Very low-depth sequencing in a founder population identifies a cardioprotective APOC3 signal missed by genome-wide imputation. Hum Mol Genet. 2016; 25(11):2360-2365. PMC: 5081052. DOI: 10.1093/hmg/ddw088. View