» Articles » PMID: 10967323

Gene Expression Data Analysis

Overview
Journal FEBS Lett
Specialty Biochemistry
Date 2000 Sep 1
PMID 10967323
Citations 80
Authors
Affiliations
Soon will be listed here.
Abstract

Microarrays are one of the latest breakthroughs in experimental molecular biology, which allow monitoring of gene expression for tens of thousands of genes in parallel and are already producing huge amounts of valuable data. Analysis and handling of such data is becoming one of the major bottlenecks in the utilization of the technology. The raw microarray data are images, which have to be transformed into gene expression matrices--tables where rows represent genes, columns represent various samples such as tissues or experimental conditions, and numbers in each cell characterize the expression level of the particular gene in the particular sample. These matrices have to be analyzed further, if any knowledge about the underlying biological processes is to be extracted. In this paper we concentrate on discussing bioinformatics methods used for such analysis. We briefly discuss supervised and unsupervised data analysis and its applications, such as predicting gene function classes and cancer classification. Then we discuss how the gene expression matrix can be used to predict putative regulatory signals in the genome sequences. In conclusion we discuss some possible future directions.

Citing Articles

Identification of crosstalk genes and immune characteristics between Alzheimer's disease and atherosclerosis.

An W, Zhou J, Qiu Z, Wang P, Han X, Cheng Y Front Immunol. 2024; 15:1443464.

PMID: 39188714 PMC: 11345154. DOI: 10.3389/fimmu.2024.1443464.


Reference genes for Eucalyptus spp. under Beauveria bassiana inoculation and subsequently infestation by the galling wasp Leptocybe invasa.

Daude M, Sagio S, Rodrigues J, Lima N, Lima A, Sarmento M Sci Rep. 2024; 14(1):2556.

PMID: 38297150 PMC: 10830493. DOI: 10.1038/s41598-024-52948-x.


Improved Regularized Multi-class Logistic Regression for Gene Classification with Optimal Kernel PCA and HC Algorithm.

Mohammed N Adv Exp Med Biol. 2023; 1424:273-279.

PMID: 37486504 DOI: 10.1007/978-3-031-31982-2_31.


ForestSubtype: a cancer subtype identifying approach based on high-dimensional genomic data and a parallel random forest.

Luo J, Feng Y, Wu X, Li R, Shi J, Chang W BMC Bioinformatics. 2023; 24(1):289.

PMID: 37468832 PMC: 10354904. DOI: 10.1186/s12859-023-05412-y.


Applications of transformer-based language models in bioinformatics: a survey.

Zhang S, Fan R, Liu Y, Chen S, Liu Q, Zeng W Bioinform Adv. 2023; 3(1):vbad001.

PMID: 36845200 PMC: 9950855. DOI: 10.1093/bioadv/vbad001.