» Articles » PMID: 21653522

The Variant Call Format and VCFtools

Overview
Journal Bioinformatics
Specialty Biology
Date 2011 Jun 10
PMID 21653522
Citations 6422
Authors
Affiliations
Soon will be listed here.
Abstract

Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API.

Availability: http://vcftools.sourceforge.net

Citing Articles

Next-generation sequencing-based population genetics unravels the evolutionary history of Rhodomyrtus tomentosa in China.

Xu X, Liao B, Liao S, Qin Q, He C, Ding X BMC Plant Biol. 2025; 25(1):338.

PMID: 40089704 DOI: 10.1186/s12870-025-06364-6.


Hordeum I genome unlocks adaptive evolution and genetic potential for crop improvement.

Feng H, Du Q, Jiang Y, Jia Y, He T, Wang Y Nat Plants. 2025; .

PMID: 40087544 DOI: 10.1038/s41477-025-01942-w.


Flax domesticationprocesses as inferred from genome-wide SNP data.

Fu Y Sci Rep. 2025; 15(1):8731.

PMID: 40082459 PMC: 11906640. DOI: 10.1038/s41598-025-89498-9.


Analysis of Population Structure and Selective Signatures for Milk Production Traits in Xinjiang Brown Cattle and Chinese Simmental Cattle.

Ma K, Li X, Ma S, Zhang M, Wang D, Xu L Int J Mol Sci. 2025; 26(5).

PMID: 40076627 PMC: 11900343. DOI: 10.3390/ijms26052003.


Genomic Insights into the Population Genetics and Adaptive Evolution of Yellow Seabream () with Whole-Genome Resequencing.

Li Y, Yang J, Fang Y, Zhang R, Cai Z, Shan B Animals (Basel). 2025; 15(5).

PMID: 40076030 PMC: 11898413. DOI: 10.3390/ani15050745.


References
1.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N . The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009; 25(16):2078-9. PMC: 2723002. DOI: 10.1093/bioinformatics/btp352. View

2.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A . The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20(9):1297-303. PMC: 2928508. DOI: 10.1101/gr.107524.110. View

3.
Reese M, Moore B, Batchelor C, Salas F, Cunningham F, Marth G . A standard variation file format for human genome sequences. Genome Biol. 2010; 11(8):R88. PMC: 2945790. DOI: 10.1186/gb-2010-11-8-r88. View

4.
Abecasis G, Altshuler D, Auton A, Brooks L, Durbin R, Gibbs R . A map of human genome variation from population-scale sequencing. Nature. 2010; 467(7319):1061-73. PMC: 3042601. DOI: 10.1038/nature09534. View