» Articles » PMID: 38187579

Matrix Linear Models for Connecting Metabolite Composition to Individual Characteristics

Overview
Journal bioRxiv
Date 2024 Jan 8
PMID 38187579
Authors
Affiliations
Soon will be listed here.
Abstract

High-throughput metabolomics data provide a detailed molecular window into biological processes. We consider the problem of assessing how the association of metabolite levels with individual (sample) characteristics such as sex or treatment may depend on metabolite characteristics such as pathway. Typically this is one in a two-step process: In the first step we assess the association of each metabolite with individual characteristics. In the second step an enrichment analysis is performed by metabolite characteristics among significant associations. We combine the two steps using a bilinear model based on the matrix linear model (MLM) framework we have previously developed for high-throughput genetic screens. Our framework can estimate relationships in metabolites sharing known characteristics, whether categorical (such as type of lipid or pathway) or numerical (such as number of double bonds in triglycerides). We demonstrate how MLM offers flexibility and interpretability by applying our method to three metabolomic studies. We show that our approach can separate the contribution of the overlapping triglycerides characteristics, such as the number of double bonds and the number of carbon atoms. The proposed method have been implemented in the open-source Julia package, MatrixLM. Data analysis scripts with example data analyses are also available.

References
1.
Dieterle F, Ross A, Schlotterbeck G, Senn H . Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures. Application in 1H NMR metabonomics. Anal Chem. 2006; 78(13):4281-90. DOI: 10.1021/ac051632c. View

2.
Leek J, Johnson W, Parker H, Jaffe A, Storey J . The sva package for removing batch effects and other unwanted variation in high-throughput experiments. Bioinformatics. 2012; 28(6):882-3. PMC: 3307112. DOI: 10.1093/bioinformatics/bts034. View

3.
Gillenwater L, Kechris K, Pratte K, Reisdorph N, Petrache I, Labaki W . Metabolomic Profiling Reveals Sex Specific Associations with Chronic Obstructive Pulmonary Disease and Emphysema. Metabolites. 2021; 11(3). PMC: 7999201. DOI: 10.3390/metabo11030161. View

4.
Koelmel J, Ulmer C, Fogelson S, Jones C, Botha H, Bangma J . Lipidomics for wildlife disease etiology and biomarker discovery: a case study of pansteatitis outbreak in South Africa. Metabolomics. 2019; 15(3):38. PMC: 11005104. DOI: 10.1007/s11306-019-1490-9. View

5.
Nyamundanda G, Brennan L, Gormley I . Probabilistic principal component analysis for metabolomic data. BMC Bioinformatics. 2010; 11:571. PMC: 3006395. DOI: 10.1186/1471-2105-11-571. View