» Articles » PMID: 38250421

Review of Computational Methods and Database Sources for Predicting the Effects of Coding Frameshift Small Insertion and Deletion Variations

Overview
Journal ACS Omega
Specialty Chemistry
Date 2024 Jan 22
PMID 38250421
Authors
Affiliations
Soon will be listed here.
Abstract

Genetic variations (including substitutions, insertions, and deletions) exert a profound influence on DNA sequences. These variations are systematically classified as synonymous, nonsynonymous, and nonsense, each manifesting distinct effects on proteins. The implementation of high-throughput sequencing has significantly augmented our comprehension of the intricate interplay between gene variations and protein structure and function, as well as their ramifications in the context of diseases. Frameshift variations, particularly small insertions and deletions (indels), disrupt protein coding and are instrumental in disease pathogenesis. This review presents a succinct review of computational methods, databases, current challenges, and future directions in predicting the consequences of coding frameshift small indels variations. We analyzed the predictive efficacy, reliability, and utilization of computational methods and variant account, reliability, and utilization of database. Besides, we also compared the prediction methodologies on GOF/LOF pathogenic variation data. Addressing the challenges pertaining to prediction accuracy and cross-species generalizability, nascent technologies such as AI and deep learning harbor immense potential to enhance predictive capabilities. The importance of interdisciplinary research and collaboration cannot be overstated for devising effective diagnosis, treatment, and prevention strategies concerning diseases associated with coding frameshift indels variations.

Citing Articles

A next-generation sequencing-based universal target panel and algorithm for one-stop detection of copy number alterations and single-nucleotide variations in the HBB gene cluster for rapid diagnosis of β-thalassemia.

Pal D, Chowdhury P, Nayek K, Biswas N, Das S, Basu A Mol Biol Rep. 2025; 52(1):128.

PMID: 39820710 DOI: 10.1007/s11033-024-10196-2.


MetaCGRP is a high-precision meta-model for large-scale identification of CGRP inhibitors using multi-view information.

Schaduangrat N, Khemawoot P, Jiso A, Charoenkwan P, Shoombuatong W Sci Rep. 2024; 14(1):24764.

PMID: 39433940 PMC: 11494111. DOI: 10.1038/s41598-024-75487-x.


PLMACPred prediction of anticancer peptides based on protein language model and wavelet denoising transformation.

Arif M, Musleh S, Fida H, Alam T Sci Rep. 2024; 14(1):16992.

PMID: 39043738 PMC: 11266708. DOI: 10.1038/s41598-024-67433-8.

References
1.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

2.
Bennett E, Keller H, Mills R, Schmidt S, Moran J, Weichenrieder O . Active Alu retrotransposons in the human genome. Genome Res. 2008; 18(12):1875-83. PMC: 2593586. DOI: 10.1101/gr.081737.108. View

3.
Li Z, Li X, Zhou H, Gaynor S, Selvaraj M, Arapoglou T . A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies. Nat Methods. 2022; 19(12):1599-1611. PMC: 10008172. DOI: 10.1038/s41592-022-01640-x. View

4.
Xiang X, Zhao X, Pan X, Dong Z, Yu J, Li S . Efficient correction of Duchenne muscular dystrophy mutations by SpCas9 and dual gRNAs. Mol Ther Nucleic Acids. 2021; 24:403-415. PMC: 8039775. DOI: 10.1016/j.omtn.2021.03.005. View

5.
Firth H, Richards S, Bevan A, Clayton S, Corpas M, Rajan D . DECIPHER: Database of Chromosomal Imbalance and Phenotype in Humans Using Ensembl Resources. Am J Hum Genet. 2009; 84(4):524-33. PMC: 2667985. DOI: 10.1016/j.ajhg.2009.03.010. View