» Articles » PMID: 35585280

EasyMF: A Web Platform for Matrix Factorization-Based Gene Discovery from Large-scale Transcriptome Data

Overview
Journal Interdiscip Sci
Specialty Biology
Date 2022 May 18
PMID 35585280
Authors
Affiliations
Soon will be listed here.
Abstract

With the development of high-throughput experimental technologies, large-scale RNA sequencing (RNA-Seq) data have been and continue to be produced, but have led to challenges in extracting relevant biological knowledge hidden in the produced high-dimensional gene expression matrices. Here, we develop easyMF ( https://github.com/cma2015/easyMF ), a web platform that can facilitate functional gene discovery from large-scale transcriptome data using matrix factorization (MF) algorithms. Compared with existing MF-based software packages, easyMF exhibits several promising features, such as greater functionality, flexibility and ease of use. The easyMF platform is equipped using the Big-Data-supported Galaxy system with user-friendly graphic user interfaces, allowing users with little programming experience to streamline transcriptome analysis from raw reads to gene expression, carry out multiple-scenario MF analysis, and perform multiple-way MF-based gene discovery. easyMF is also powered with the advanced packing technology to enhance ease of use under different operating systems and computational environments. We illustrated the application of easyMF for seed gene discovery from temporal, spatial, and integrated RNA-Seq datasets of maize (Zea mays L.), resulting in the identification of 3,167 seed stage-specific, 1,849 seed compartment-specific, and 774 seed-specific genes, respectively. The present results also indicated that easyMF can prioritize seed-related genes with superior prediction performance over the state-of-art network-based gene prioritization system MaizeNet. As a modular, containerized and open-source platform, easyMF can be further customized to satisfy users' specific demands of functional gene discovery and deployed as a web service for broad applications.

Citing Articles

A review of artificial intelligence-assisted omics techniques in plant defense: current trends and future directions.

Murmu S, Sinha D, Chaurasia H, Sharma S, Das R, Jha G Front Plant Sci. 2024; 15:1292054.

PMID: 38504888 PMC: 10948452. DOI: 10.3389/fpls.2024.1292054.


HetFCM: functional co-module discovery by heterogeneous network co-clustering.

Tan H, Guo M, Chen J, Wang J, Yu G Nucleic Acids Res. 2023; 52(3):e16.

PMID: 38088228 PMC: 10853805. DOI: 10.1093/nar/gkad1174.

References
1.
Nelms B, Walbot V . Defining the developmental program leading to meiosis in maize. Science. 2019; 364(6435):52-56. DOI: 10.1126/science.aav6428. View

2.
Cardoso-Moreira M, Halbert J, Valloton D, Velten B, Chen C, Shao Y . Gene expression across mammalian organ development. Nature. 2019; 571(7766):505-509. PMC: 6658352. DOI: 10.1038/s41586-019-1338-5. View

3.
Sarropoulos I, Marin R, Cardoso-Moreira M, Kaessmann H . Developmental dynamics of lncRNAs across mammalian organs and species. Nature. 2019; 571(7766):510-514. PMC: 6660317. DOI: 10.1038/s41586-019-1341-x. View

4.
Qiu Z, Chen S, Qi Y, Liu C, Zhai J, Xie S . Exploring transcriptional switches from pairwise, temporal and population RNA-Seq data using deepTS. Brief Bioinform. 2020; 22(3). DOI: 10.1093/bib/bbaa137. View

5.
Hyvarinen A, Oja E . Independent component analysis: algorithms and applications. Neural Netw. 2000; 13(4-5):411-30. DOI: 10.1016/s0893-6080(00)00026-5. View