» Articles » PMID: 28287462

SNP Discovery Using a Pangenome: Has the Single Reference Approach Become Obsolete?

Overview
Journal Biology (Basel)
Publisher MDPI
Specialty Biology
Date 2017 Mar 14
PMID 28287462
Citations 33
Authors
Affiliations
Soon will be listed here.
Abstract

Increasing evidence suggests that a single individual is insufficient to capture the genetic diversity within a species due to gene presence absence variation. In order to understand the extent to which genomic variation occurs in a species, the construction of its pangenome is necessary. The pangenome represents the complete set of genes of a species; it is composed of core genes, which are present in all individuals, and variable genes, which are present only in some individuals. Aside from variations at the gene level, single nucleotide polymorphisms (SNPs) are also an important form of genetic variation. The advent of next-generation sequencing (NGS) coupled with the heritability of SNPs make them ideal markers for genetic analysis of human, animal, and microbial data. SNPs have also been extensively used in crop genetics for association mapping, quantitative trait loci (QTL) analysis, analysis of genetic diversity, and phylogenetic analysis. This review focuses on the use of pangenomes for SNP discovery. It highlights the advantages of using a pangenome rather than a single reference for this purpose. This review also demonstrates how extra information not captured in a single reference alone can be used to provide additional support for linking genotypic data to phenotypic data.

Citing Articles

Comparison of Recombination Rate, Reference Bias, and Unique Pangenomic Haplotypes in Using Seven De Novo Genome Assemblies.

Stack G, Quade M, Wilkerson D, Monserrate L, Bentz P, Carey S Int J Mol Sci. 2025; 26(3).

PMID: 39940933 PMC: 11818205. DOI: 10.3390/ijms26031165.


The role of pangenomics in orphan crop improvement.

Hu H, Zhao J, Thomas W, Batley J, Edwards D Nat Commun. 2025; 16(1):118.

PMID: 39746989 PMC: 11696220. DOI: 10.1038/s41467-024-55260-4.


The developments and prospects of plant super-pangenomes: Demands, approaches, and applications.

He W, Li X, Qian Q, Shang L Plant Commun. 2024; 6(2):101230.

PMID: 39722458 PMC: 11897476. DOI: 10.1016/j.xplc.2024.101230.


Seamless, rapid, and accurate analyses of outbreak genomic data using split -mer analysis.

Derelle R, von Wachsmann J, Maklin T, Hellewell J, Russell T, Lalvani A Genome Res. 2024; 34(10):1661-1673.

PMID: 39406504 PMC: 11529842. DOI: 10.1101/gr.279449.124.


Genomic and cell-specific regulation of benzylisoquinoline alkaloid biosynthesis in opium poppy.

Hong U, Tamiru-Oli M, Hurgobin B, Lewsey M J Exp Bot. 2024; 76(1):35-51.

PMID: 39046316 PMC: 11659185. DOI: 10.1093/jxb/erae317.


References
1.
Tajima F . Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989; 123(3):585-95. PMC: 1203831. DOI: 10.1093/genetics/123.3.585. View

2.
Saxena R, Edwards D, Varshney R . Structural variations in plant genomes. Brief Funct Genomics. 2014; 13(4):296-307. PMC: 4110416. DOI: 10.1093/bfgp/elu016. View

3.
Cao M, Nguyen S, Ganesamoorthy D, Elliott A, Cooper M, Coin L . Scaffolding and completing genome assemblies in real-time with nanopore sequencing. Nat Commun. 2017; 8:14515. PMC: 5321748. DOI: 10.1038/ncomms14515. View

4.
Li R, Yu C, Li Y, Lam T, Yiu S, Kristiansen K . SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009; 25(15):1966-7. DOI: 10.1093/bioinformatics/btp336. View

5.
Li W, Wu C, Luo C . A new method for estimating synonymous and nonsynonymous rates of nucleotide substitution considering the relative likelihood of nucleotide and codon changes. Mol Biol Evol. 1985; 2(2):150-74. DOI: 10.1093/oxfordjournals.molbev.a040343. View