» Articles » PMID: 31722668

High Throughput Genotyping of Structural Variations in a Complex Plant Genome Using an Original Affymetrix® Axiom® Array

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2019 Nov 15
PMID 31722668
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Insertions/deletions (InDels) and more specifically presence/absence variations (PAVs) are pervasive in several species and have strong functional and phenotypic effect by removing or drastically modifying genes. Genotyping of such variants on large panels remains poorly addressed, while necessary for approaches such as association mapping or genomic selection.

Results: We have developed, as a proof of concept, a new high-throughput and affordable approach to genotype InDels. We first identified 141,000 InDels by aligning reads from the B73 line against the genome of three temperate maize inbred lines (F2, PH207, and C103) and reciprocally. Next, we designed an Affymetrix® Axiom® array to target these InDels, with a combination of probes selected at breakpoint sites (13%) or within the InDel sequence, either at polymorphic (25%) or non-polymorphic sites (63%) sites. The final array design is composed of 662,772 probes and targets 105,927 InDels, including PAVs ranging from 35 bp to 129kbp. After Affymetrix® quality control, we successfully genotyped 86,648 polymorphic InDels (82% of all InDels interrogated by the array) on 445 maize DNA samples with 422,369 probes. Genotyping InDels using this approach produced a highly reliable dataset, with low genotyping error (~ 3%), high call rate (~ 98%), and high reproducibility (> 95%). This reliability can be further increased by combining genotyping of several probes calling the same InDels (< 0.1% error rate and > 99.9% of call rate for 5 probes). This "proof of concept" tool was used to estimate the kinship matrix between 362 maize lines with 57,824 polymorphic InDels. This InDels kinship matrix was highly correlated with kinship estimated using SNPs from Illumina 50 K SNP arrays.

Conclusions: We efficiently genotyped thousands of small to large InDels on a sizeable number of individuals using a new Affymetrix® Axiom® array. This powerful approach opens the way to studying the contribution of InDels to trait variation and heterosis in maize. The approach is easily extendable to other species and should contribute to decipher the biological impact of InDels at a larger scale.

Citing Articles

Non-additive expression genes play a critical role in leaf vein ratio heterosis in Nicotiana tabacum L.

Duan L, Mo Z, Li K, Pi K, Luo J, Que Y BMC Genomics. 2024; 25(1):924.

PMID: 39363277 PMC: 11451143. DOI: 10.1186/s12864-024-10821-1.


Genetic variability of aquaporin expression in maize: From eQTLs to a MITE insertion regulating PIP2;5 expression.

Maistriaux L, Laurent M, Jeanguenin L, Prado S, Nader J, Welcker C Plant Physiol. 2024; 196(1):368-384.

PMID: 38839061 PMC: 11376376. DOI: 10.1093/plphys/kiae326.


Oxford Nanopore and Bionano Genomics technologies evaluation for plant structural variation detection.

Canaguier A, Guilbaud R, Denis E, Magdelenat G, Belser C, Istace B BMC Genomics. 2022; 23(1):317.

PMID: 35448948 PMC: 9026655. DOI: 10.1186/s12864-022-08499-4.


Increasing calling accuracy, coverage, and read-depth in sequence data by the use of haplotype blocks.

Pook T, Nemri A, Gonzalez Segovia E, Valle Torres D, Simianer H, Schoen C PLoS Genet. 2021; 17(12):e1009944.

PMID: 34941872 PMC: 8699914. DOI: 10.1371/journal.pgen.1009944.


A cost-effective barcode system for maize genetic discrimination based on bi-allelic InDel markers.

Liang S, Lin F, Qian Y, Zhang T, Wu Y, Qi Y Plant Methods. 2020; 16:101.

PMID: 32742299 PMC: 7391534. DOI: 10.1186/s13007-020-00644-y.

References
1.
Belo A, Zheng P, Luck S, Shen B, Meyer D, Li B . Whole genome scan detects an allelic variant of fad2 associated with increased oleic acid levels in maize. Mol Genet Genomics. 2007; 279(1):1-10. DOI: 10.1007/s00438-007-0289-y. View

2.
Morgante M, De Paoli E, Radovic S . Transposable elements and the plant pan-genomes. Curr Opin Plant Biol. 2007; 10(2):149-55. DOI: 10.1016/j.pbi.2007.02.001. View

3.
Chen K, Wallis J, McLellan M, Larson D, Kalicki J, Pohl C . BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods. 2009; 6(9):677-81. PMC: 3661775. DOI: 10.1038/nmeth.1363. View

4.
Hupe P, Stransky N, Thiery J, Radvanyi F, Barillot E . Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics. 2004; 20(18):3413-22. DOI: 10.1093/bioinformatics/bth418. View

5.
Wang X, Lebarbier E, Aubert J, Robin S . Variational Inference for Coupled Hidden Markov Models Applied to the Joint Detection of Copy Number Variations. Int J Biostat. 2019; 15(1). DOI: 10.1515/ijb-2018-0023. View