Gene Expression Data Analysis
Overview
Affiliations
Microarrays are one of the latest breakthroughs in experimental molecular biology, which allow monitoring of gene expression for tens of thousands of genes in parallel and are already producing huge amounts of valuable data. Analysis and handling of such data is becoming one of the major bottlenecks in the utilization of the technology. The raw microarray data are images, which have to be transformed into gene expression matrices--tables where rows represent genes, columns represent various samples such as tissues or experimental conditions, and numbers in each cell characterize the expression level of the particular gene in the particular sample. These matrices have to be analyzed further, if any knowledge about the underlying biological processes is to be extracted. In this paper we concentrate on discussing bioinformatics methods used for such analysis. We briefly discuss supervised and unsupervised data analysis and its applications, such as predicting gene function classes and cancer classification. Then we discuss how the gene expression matrix can be used to predict putative regulatory signals in the genome sequences. In conclusion we discuss some possible future directions.
An W, Zhou J, Qiu Z, Wang P, Han X, Cheng Y Front Immunol. 2024; 15:1443464.
PMID: 39188714 PMC: 11345154. DOI: 10.3389/fimmu.2024.1443464.
Daude M, Sagio S, Rodrigues J, Lima N, Lima A, Sarmento M Sci Rep. 2024; 14(1):2556.
PMID: 38297150 PMC: 10830493. DOI: 10.1038/s41598-024-52948-x.
Mohammed N Adv Exp Med Biol. 2023; 1424:273-279.
PMID: 37486504 DOI: 10.1007/978-3-031-31982-2_31.
Luo J, Feng Y, Wu X, Li R, Shi J, Chang W BMC Bioinformatics. 2023; 24(1):289.
PMID: 37468832 PMC: 10354904. DOI: 10.1186/s12859-023-05412-y.
Applications of transformer-based language models in bioinformatics: a survey.
Zhang S, Fan R, Liu Y, Chen S, Liu Q, Zeng W Bioinform Adv. 2023; 3(1):vbad001.
PMID: 36845200 PMC: 9950855. DOI: 10.1093/bioadv/vbad001.