» Articles » PMID: 39729312

GeniePool 2.0: Advancing Variant Analysis Through CHM13-T2T, AlphaMissense, GnomAD V4 Integration, and Variant Co-occurrence Queries

Overview
Specialty Biology
Date 2024 Dec 27
PMID 39729312
Authors
Affiliations
Soon will be listed here.
Abstract

Originally developed to meet the challenges of genomic data deluge, GeniePool emerged as a pioneering platform, enabling efficient storage, accessibility, and analysis of vast genomic datasets, enabled due to its data lake architecture. Building on this foundation, GeniePool 2.0 advances genomic analysis through the integration of cutting-edge variant databases, such as CHM13-T2T, AlphaMissense, and gnomAD V4, coupled with the capability for variant co-occurrence queries. This evolution offers an unprecedented level of granularity and scope in genomic analyses, from enhancing our understanding of variant pathogenicity and phenotypic associations to facilitating research collaborations. The introduction of CHM13-T2T provides a more accurate reference for human genetic variation, AlphaMissense enriches the platform with protein-level impact predictions of missense mutations, and gnomAD V4 offers a comprehensive view of human genetic diversity. Additionally, the innovative feature for variant co-occurrence analysis is pivotal for exploring the combined effects of genetic variations, advancing our comprehension of compound heterozygosity, epistasis, and polygenic risk factors in disease pathogenesis. GeniePool 2.0 is a comprehensive and scalable platform, which aims to enhance genomic data analysis and contribute to genomic research, potentially supporting new discoveries and clinical innovations. Database URL: https://GeniePool.link.

References
1.
Guo M, Francioli L, Stenton S, Goodrich J, Watts N, Singer-Berk M . Inferring compound heterozygosity from large-scale exome sequencing data. Nat Genet. 2023; 56(1):152-161. PMC: 10872287. DOI: 10.1038/s41588-023-01608-3. View

2.
Nurk S, Koren S, Rhie A, Rautiainen M, Bzikadze A, Mikheenko A . The complete sequence of a human genome. Science. 2022; 376(6588):44-53. PMC: 9186530. DOI: 10.1126/science.abj6987. View

3.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A . The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20(9):1297-303. PMC: 2928508. DOI: 10.1101/gr.107524.110. View

4.
Su Z, Wang Z, Ni X, Duan J, Gao Y, Zhuo M . Inferring the Evolution and Progression of Small-Cell Lung Cancer by Single-Cell Sequencing of Circulating Tumor Cells. Clin Cancer Res. 2019; 25(16):5049-5060. DOI: 10.1158/1078-0432.CCR-18-3571. View

5.
Krushkal J, Zhao Y, Hose C, Monks A, Doroshow J, Simon R . Longitudinal Transcriptional Response of Glycosylation-Related Genes, Regulators, and Targets in Cancer Cell Lines Treated With 11 Antitumor Agents. Cancer Inform. 2017; 16:1176935117747259. PMC: 5734428. DOI: 10.1177/1176935117747259. View