» Articles » PMID: 26464167

Compression Distance Can Discriminate Animals by Genetic Profile, Build Relationship Matrices and Estimate Breeding Values

Overview
Journal Genet Sel Evol
Publisher Biomed Central
Specialties Biology
Genetics
Date 2015 Oct 15
PMID 26464167
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Genetic relatedness is currently estimated by a combination of traditional pedigree-based approaches (i.e. numerator relationship matrices, NRM) and, given the recent availability of molecular information, using marker genotypes (via genomic relationship matrices, GRM). To date, GRM are computed by genome-wide pair-wise SNP (single nucleotide polymorphism) correlations.

Results: We describe a new estimate of genetic relatedness using the concept of normalised compression distance (NCD) that is borrowed from Information Theory. Analogous to GRM, the resultant compression relationship matrix (CRM) exploits numerical patterns in genome-wide allele order and proportion, which are known to vary systematically with relatedness. We explored properties of the CRM in two industry cattle datasets by analysing the genetic basis of yearling weight, a phenotype of moderate heritability. In both Brahman (Bos indicus) and Tropical Composite (Bos taurus by Bos indicus) populations, the clustering inferred by NCD was comparable to that based on SNP correlations using standard principal component analysis approaches. One of the versions of the CRM modestly increased the amount of explained genetic variance, slightly reduced the 'missing heritability' and tended to improve the prediction accuracy of breeding values in both populations when compared to both NRM and GRM. Finally, a sliding window-based application of the compression approach on these populations identified genomic regions influenced by introgression of taurine haplotypes.

Conclusions: For these two bovine populations, CRM reduced the missing heritability and increased the amount of explained genetic variation for a moderately heritable complex trait. Given that NCD can sensitively discriminate closely related individuals, we foresee CRM having possible value for estimating breeding values in highly inbred populations.

Citing Articles

RAPID COMMUNICATION: A haplotype information theory method reveals genes of evolutionary interest in European vs. Asian pigs.

Hudson N, Naval-Sanchez M, Porto-Neto L, Perez-Enciso M, Reverter A J Anim Sci. 2018; 96(8):3064-3069.

PMID: 29873754 PMC: 6095408. DOI: 10.1093/jas/sky225.


The Bos taurus-Bos indicus balance in fertility and milk related genes.

Kasarapu P, Porto-Neto L, Fortes M, Lehnert S, Mudadu M, Coutinho L PLoS One. 2017; 12(8):e0181930.

PMID: 28763475 PMC: 5538644. DOI: 10.1371/journal.pone.0181930.

References
1.
Lachance J, Tishkoff S . SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it. Bioessays. 2013; 35(9):780-6. PMC: 3849385. DOI: 10.1002/bies.201300014. View

2.
VanRaden P . Efficient methods to compute genomic predictions. J Dairy Sci. 2008; 91(11):4414-23. DOI: 10.3168/jds.2007-0980. View

3.
Xu L, Bickhart D, Cole J, Schroeder S, Song J, Van Tassell C . Genomic signatures reveal new evidences for selection of important traits in domestic cattle. Mol Biol Evol. 2014; 32(3):711-25. PMC: 4441790. DOI: 10.1093/molbev/msu333. View

4.
Zhang Q, Lee H, Han J, Kim E, Kang S, Yin J . Differentially expressed proteins during fat accumulation in bovine skeletal muscle. Meat Sci. 2010; 86(3):814-20. DOI: 10.1016/j.meatsci.2010.07.002. View

5.
de Camargo G, Costa R, de Albuquerque L, Regitano L, Baldi F, Tonhati H . Polymorphisms in TOX and NCOA2 genes and their associations with reproductive traits in cattle. Reprod Fertil Dev. 2014; 27(3):523-8. DOI: 10.1071/RD13360. View