» Articles » PMID: 33773584

Boosting Heritability: Estimating the Genetic Component of Phenotypic Variation with Multiple Sample Splitting

Overview
Publisher Biomed Central
Specialty Biology
Date 2021 Mar 28
PMID 33773584
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Heritability is a central measure in genetics quantifying how much of the variability observed in a trait is attributable to genetic differences. Existing methods for estimating heritability are most often based on random-effect models, typically for computational reasons. The alternative of using a fixed-effect model has received much more limited attention in the literature.

Results: In this paper, we propose a generic strategy for heritability inference, termed as "boosting heritability", by combining the advantageous features of different recent methods to produce an estimate of the heritability with a high-dimensional linear model. Boosting heritability uses in particular a multiple sample splitting strategy which leads in general to a stable and accurate estimate. We use both simulated data and real antibiotic resistance data from a major human pathogen, Sptreptococcus pneumoniae, to demonstrate the attractive features of our inference strategy.

Conclusions: Boosting is shown to offer a reliable and practically useful tool for inference about heritability.

Citing Articles

Optimising machine learning prediction of minimum inhibitory concentrations in .

Batisti Biffignandi G, Chindelevitch L, Corbella M, Feil E, Sassera D, Lees J Microb Genom. 2024; 10(3).

PMID: 38529944 PMC: 10995625. DOI: 10.1099/mgen.0.001222.


Insights into the influence of diet and genetics on feed efficiency and meat production in sheep.

Chacko Kaitholil S, Mooney M, Aubry A, Rezwan F, Shirali M Anim Genet. 2023; 55(1):20-46.

PMID: 38112204 PMC: 10952161. DOI: 10.1111/age.13383.


Inferring the heritability of bacterial traits in the era of machine learning.

Mai T, Lees J, Gladstone R, Corander J Bioinform Adv. 2023; 3(1):vbad027.

PMID: 36974068 PMC: 10039732. DOI: 10.1093/bioadv/vbad027.


Genome-wide association, prediction and heritability in bacteria with application to .

Mallawaarachchi S, Tonkin-Hill G, Croucher N, Turner P, Speed D, Corander J NAR Genom Bioinform. 2022; 4(1):lqac011.

PMID: 35211669 PMC: 8862724. DOI: 10.1093/nargab/lqac011.

References
1.
Janson L, Barber R, Candes E . EigenPrism: inference for high dimensional signal-to-noise ratios. J R Stat Soc Series B Stat Methodol. 2017; 79(4):1037-1065. PMC: 5663223. DOI: 10.1111/rssb.12203. View

2.
Buzdugan L, Kalisch M, Navarro A, Schunk D, Fehr E, Buhlmann P . Assessing statistical significance in multivariable genome wide association analysis. Bioinformatics. 2016; 32(13):1990-2000. PMC: 4920127. DOI: 10.1093/bioinformatics/btw128. View

3.
Bulik-Sullivan B, Loh P, Finucane H, Ripke S, Yang J, Patterson N . LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet. 2015; 47(3):291-5. PMC: 4495769. DOI: 10.1038/ng.3211. View

4.
Zhou X . A UNIFIED FRAMEWORK FOR VARIANCE COMPONENT ESTIMATION WITH SUMMARY STATISTICS IN GENOME-WIDE ASSOCIATION STUDIES. Ann Appl Stat. 2018; 11(4):2027-2051. PMC: 5836736. DOI: 10.1214/17-AOAS1052. View

5.
Weissbrod O, Flint J, Rosset S . Estimating SNP-Based Heritability and Genetic Correlation in Case-Control Studies Directly and with Summary Statistics. Am J Hum Genet. 2018; 103(1):89-99. PMC: 6035374. DOI: 10.1016/j.ajhg.2018.06.002. View