» Articles » PMID: 39964984

NAVIP: Unraveling the Influence of Neighboring Small Sequence Variants on Functional Impact Prediction

Overview
Specialty Biology
Date 2025 Feb 18
PMID 39964984
Authors
Affiliations
Soon will be listed here.
Abstract

Once a suitable reference sequence has been generated, intra-species variation is often assessed by re-sequencing. Variant calling processes can reveal all differences between strains, accessions, genotypes, or individuals. These variants can be enriched with predictions about their functional implications based on available structural annotations, i.e., gene models. Although these functional impact predictions on a per-variant basis are often accurate, some challenging cases require the simultaneous incorporation of multiple adjacent variants into this prediction process. Examples include neighboring variants which modify each other's functional impact. The Neighborhood-Aware Variant Impact Predictor (NAVIP) considers all variants within a given protein coding sequence when predicting the effect. As a proof of concept, variants between the Arabidopsis thaliana accessions Columbia-0 and Niederzenz-1 were annotated. NAVIP is freely available on GitHub (https://github.com/bpucker/NAVIP) and accessible through a web server (https://pbb-tools.de).

References
1.
Choudhary N, Pucker B . Conserved amino acid residues and gene expression patterns associated with the substrate preferences of the competing enzymes FLS and DFR. PLoS One. 2024; 19(8):e0305837. PMC: 11356453. DOI: 10.1371/journal.pone.0305837. View

2.
Katsonis P, Wilhelm K, Williams A, Lichtarge O . Genome interpretation using in silico predictors of variant impact. Hum Genet. 2022; 141(10):1549-1577. PMC: 9055222. DOI: 10.1007/s00439-022-02457-6. View

3.
Rosso M, Li Y, Strizhov N, Reiss B, Dekker K, Weisshaar B . An Arabidopsis thaliana T-DNA mutagenized population (GABI-Kat) for flanking sequence tag-based reverse genetics. Plant Mol Biol. 2004; 53(1-2):247-59. DOI: 10.1023/B:PLAN.0000009297.37235.4a. View

4.
Ahsan M, Liu Q, Fang L, Wang K . NanoCaller for accurate detection of SNPs and indels in difficult-to-map regions from long-read sequencing by haplotype-aware deep neural networks. Genome Biol. 2021; 22(1):261. PMC: 8419925. DOI: 10.1186/s13059-021-02472-2. View

5.
Stein L . The case for cloud computing in genome informatics. Genome Biol. 2010; 11(5):207. PMC: 2898083. DOI: 10.1186/gb-2010-11-5-207. View