» Articles » PMID: 29910825

Matrix Integrative Analysis (MIA) of Multiple Genomic Data for Modular Patterns

Overview
Journal Front Genet
Date 2018 Jun 19
PMID 29910825
Authors
Affiliations
Soon will be listed here.
Abstract

The increasing availability of high-throughput biological data, especially multi-dimensional genomic data across the same samples, has created an urgent need for modular and integrative analysis tools that can reveal the relationships among different layers of cellular activities. To this end, we present a MATLAB package, Matrix Integration Analysis (MIA), implementing and extending four published methods, designed based on two classical techniques, non-negative matrix factorization (NMF), and partial least squares (PLS). This package can integrate diverse types of genomic data (e.g., copy number variation, DNA methylation, gene expression, microRNA expression profiles, and/or gene network data) to identify the underlying modular patterns by each method. Particularly, we demonstrate the differences between these two classes of methods, which give users some suggestions about how to select a suitable method in the MIA package. MIA is a flexible tool which could handle a wide range of biological problems and data types. Besides, we also provide an executable version for users without a MATLAB license.

References
1.
Nguyen D, Rocke D . Multi-class cancer classification via partial least squares with gene expression profiles. Bioinformatics. 2002; 18(9):1216-26. DOI: 10.1093/bioinformatics/18.9.1216. View

2.
Li Y, Ngom A . The non-negative matrix factorization toolbox for biological data mining. Source Code Biol Med. 2013; 8(1):10. PMC: 3736608. DOI: 10.1186/1751-0473-8-10. View

3.
Kasar S, Kim J, Improgo R, Tiao G, Polak P, Haradhvala N . Whole-genome sequencing reveals activation-induced cytidine deaminase signatures during indolent chronic lymphocytic leukaemia evolution. Nat Commun. 2015; 6:8866. PMC: 4686820. DOI: 10.1038/ncomms9866. View

4.
Le Cao K, Boitard S, Besse P . Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems. BMC Bioinformatics. 2011; 12:253. PMC: 3133555. DOI: 10.1186/1471-2105-12-253. View

5.
Boulesteix A, Strimmer K . Predicting transcription factor activities from combined analysis of microarray and ChIP data: a partial least squares approach. Theor Biol Med Model. 2005; 2:23. PMC: 1182396. DOI: 10.1186/1742-4682-2-23. View