» Articles » PMID: 24708222

VCGDB: a Dynamic Genome Database of the Chinese Population

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2014 Apr 9
PMID 24708222
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The data released by the 1000 Genomes Project contain an increasing number of genome sequences from different nations and populations with a large number of genetic variations. As a result, the focus of human genome studies is changing from single and static to complex and dynamic. The currently available human reference genome (GRCh37) is based on sequencing data from 13 anonymous Caucasian volunteers, which might limit the scope of genomics, transcriptomics, epigenetics, and genome wide association studies.

Description: We used the massive amount of sequencing data published by the 1000 Genomes Project Consortium to construct the Virtual Chinese Genome Database (VCGDB), a dynamic genome database of the Chinese population based on the whole genome sequencing data of 194 individuals. VCGDB provides dynamic genomic information, which contains 35 million single nucleotide variations (SNVs), 0.5 million insertions/deletions (indels), and 29 million rare variations, together with genomic annotation information. VCGDB also provides a highly interactive user-friendly virtual Chinese genome browser (VCGBrowser) with functions like seamless zooming and real-time searching. In addition, we have established three population-specific consensus Chinese reference genomes that are compatible with mainstream alignment software.

Conclusions: VCGDB offers a feasible strategy for processing big data to keep pace with the biological data explosion by providing a robust resource for genomics studies; in particular, studies aimed at finding regions of the genome associated with diseases.

Citing Articles

T2T-YAO, T2T-SHUN, and more.

Xiao J, Yu J Genomics Proteomics Bioinformatics. 2023; 21(6):1081-1082.

PMID: 37742994 PMC: 11082254. DOI: 10.1016/j.gpb.2023.09.002.


-associated syndrome caused by a novel mutation in a Chinese boy: A case report and literature review.

Zhu Y, Sun G, Yang Z World J Clin Cases. 2021; 9(21):6081-6090.

PMID: 34368330 PMC: 8316932. DOI: 10.12998/wjcc.v9.i21.6081.


RGAAT: A Reference-based Genome Assembly and Annotation Tool for New Genomes and Upgrade of Known Genomes.

Liu W, Wu S, Lin Q, Gao S, Ding F, Zhang X Genomics Proteomics Bioinformatics. 2018; 16(5):373-381.

PMID: 30583062 PMC: 6364042. DOI: 10.1016/j.gpb.2018.03.006.


The BIG Data Center: from deposition to integration to translation.

Nucleic Acids Res. 2016; 45(D1):D18-D24.

PMID: 27899658 PMC: 5210546. DOI: 10.1093/nar/gkw1060.


Precision Medicine: What Do We Expect in the Scope of Basic Biomedical Sciences?.

Yu J Genomics Proteomics Bioinformatics. 2016; 14(1):1-3.

PMID: 26883672 PMC: 4792840. DOI: 10.1016/j.gpb.2016.02.001.


References
1.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View

2.
Hulsen T, de Vlieg J, Alkema W . BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams. BMC Genomics. 2008; 9:488. PMC: 2584113. DOI: 10.1186/1471-2164-9-488. View

3.
Tsutsui M, Rahong S, Iizumi Y, Okazaki T, Taniguchi M, Kawai T . Single-molecule sensing electrode embedded in-plane nanopore. Sci Rep. 2012; 1:46. PMC: 3216533. DOI: 10.1038/srep00046. View

4.
Strange A, Capon F, Spencer C, Knight J, Weale M, Allen M . A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1. Nat Genet. 2010; 42(11):985-90. PMC: 3749730. DOI: 10.1038/ng.694. View

5.
Wang J, Wang W, Li R, Li Y, Tian G, Goodman L . The diploid genome sequence of an Asian individual. Nature. 2008; 456(7218):60-5. PMC: 2716080. DOI: 10.1038/nature07484. View