» Articles » PMID: 26654230

Construction of Relatedness Matrices Using Genotyping-by-sequencing Data

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2015 Dec 15
PMID 26654230
Citations 59
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Genotyping-by-sequencing (GBS) is becoming an attractive alternative to array-based methods for genotyping individuals for a large number of single nucleotide polymorphisms (SNPs). Costs can be lowered by reducing the mean sequencing depth, but this results in genotype calls of lower quality. A common analysis strategy is to filter SNPs to just those with sufficient depth, thereby greatly reducing the number of SNPs available. We investigate methods for estimating relatedness using GBS data, including results of low depth, using theoretical calculation, simulation and application to a real data set.

Results: We show that unbiased estimates of relatedness can be obtained by using only those SNPs with genotype calls in both individuals. The expected value of this estimator is independent of the SNP depth in each individual, under a model of genotype calling that includes the special case of the two alleles being read at random. In contrast, the estimator of self-relatedness does depend on the SNP depth, and we provide a modification to provide unbiased estimates of self-relatedness. We refer to these methods of estimation as kinship using GBS with depth adjustment (KGD). The estimators can be calculated using matrix methods, which allow efficient computation. Simulation results were consistent with the methods being unbiased, and suggest that the optimal sequencing depth is around 2-4 for relatedness between individuals and 5-10 for self-relatedness. Application to a real data set revealed that some SNP filtering may still be necessary, for the exclusion of SNPs which did not behave in a Mendelian fashion. A simple graphical method (a 'fin plot') is given to illustrate this issue and to guide filtering parameters.

Conclusion: We provide a method which gives unbiased estimates of relatedness, based on SNPs assayed by GBS, which accounts for the depth (including zero depth) of the genotype calls. This allows GBS to be applied at read depths which can be chosen to optimise the information obtained. SNPs with excess heterozygosity, often due to (partial) polyploidy or other duplications can be filtered based on a simple graphical method.

Citing Articles

Single-cell eQTL mapping in yeast reveals a tradeoff between growth and reproduction.

Boocock J, Alexander N, Alamo Tapia L, Walter-McNeill L, Patel S, Munugala C Elife. 2025; 13.

PMID: 40073070 PMC: 11903034. DOI: 10.7554/eLife.95566.


Genomic selection shows improved expected genetic gain over phenotypic selection of agronomic traits in allotetraploid white clover.

Ehoche O, Arojju S, Jahufer M, Jauregui R, Larking A, Cousins G Theor Appl Genet. 2025; 138(1):34.

PMID: 39847157 PMC: 11757872. DOI: 10.1007/s00122-025-04819-w.


Genetic diversity and background pollen contamination in Norway spruce and Scots pine seed orchard crops.

Heuchel A, Hall D, Zhao W, Gao J, Wennstrom U, Wang X For Res (Fayettev). 2024; 2:8.

PMID: 39525423 PMC: 11524256. DOI: 10.48130/FR-2022-0008.


Genetic parameters and genotype-by-environment interaction estimates for growth and feed efficiency related traits in Chinook salmon, Oncorhynchus tshawytscha, reared under low and moderate flow regimes.

Prescott L, Scholtens M, Walker S, Clarke S, Dodds K, Miller M Genet Sel Evol. 2024; 56(1):63.

PMID: 39266967 PMC: 11396914. DOI: 10.1186/s12711-024-00929-z.


Coat colour in marsupials: genetic variants at the locus determine grey and black fur of the brushtail possum.

Bond D, Veale A, Alexander A, Hore T R Soc Open Sci. 2024; 11(7):240806.

PMID: 39086822 PMC: 11288674. DOI: 10.1098/rsos.240806.


References
1.
Elshire R, Glaubitz J, Sun Q, Poland J, Kawamoto K, Buckler E . A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One. 2011; 6(5):e19379. PMC: 3087801. DOI: 10.1371/journal.pone.0019379. View

2.
Deschamps S, Llaca V, May G . Genotyping-by-Sequencing in Plants. Biology (Basel). 2014; 1(3):460-83. PMC: 4009820. DOI: 10.3390/biology1030460. View

3.
Lu F, Lipka A, Glaubitz J, Elshire R, Cherney J, Casler M . Switchgrass genomic diversity, ploidy, and evolution: novel insights from a network-based SNP discovery protocol. PLoS Genet. 2013; 9(1):e1003215. PMC: 3547862. DOI: 10.1371/journal.pgen.1003215. View

4.
Gamal El-Dien O, Ratcliffe B, Klapste J, Chen C, Porth I, El-Kassaby Y . Prediction accuracies for growth and wood attributes of interior spruce in space using genotyping-by-sequencing. BMC Genomics. 2015; 16:370. PMC: 4424896. DOI: 10.1186/s12864-015-1597-y. View

5.
VanRaden P . Efficient methods to compute genomic predictions. J Dairy Sci. 2008; 91(11):4414-23. DOI: 10.3168/jds.2007-0980. View