» Articles » PMID: 19151090

Allegro: Analyzing Expression and Sequence in Concert to Discover Regulatory Programs

Overview
Specialty Biochemistry
Date 2009 Jan 20
PMID 19151090
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

A major goal of system biology is the characterization of transcription factors and microRNAs (miRNAs) and the transcriptional programs they regulate. We present Allegro, a method for de-novo discovery of cis-regulatory transcriptional programs through joint analysis of genome-wide expression data and promoter or 3' UTR sequences. The algorithm uses a novel log-likelihood-based, non-parametric model to describe the expression pattern shared by a group of co-regulated genes. We show that Allegro is more accurate and sensitive than existing techniques, and can simultaneously analyze multiple expression datasets with more than 100 conditions. We apply Allegro on datasets from several species and report on the transcriptional modules it uncovers. Our analysis reveals a novel motif over-represented in the promoters of genes highly expressed in murine oocytes, and several new motifs related to fly development. Finally, using stem-cell expression profiles, we identify three miRNA families with pivotal roles in human embryogenesis.

Citing Articles

Quantifying the tissue-specific regulatory information within enhancer DNA sequences.

Benner P, Vingron M NAR Genom Bioinform. 2021; 3(4):lqab095.

PMID: 34729474 PMC: 8557370. DOI: 10.1093/nargab/lqab095.


NF-Y Subunits Overexpression in HNSCC.

Bezzecchi E, Bernardini A, Ronzio M, Miccolo C, Chiocca S, Dolfini D Cancers (Basel). 2021; 13(12).

PMID: 34208636 PMC: 8234210. DOI: 10.3390/cancers13123019.


NF-YA Overexpression in Lung Cancer: LUAD.

Bezzecchi E, Ronzio M, Semeghini V, Andrioletti V, Mantovani R, Dolfini D Genes (Basel). 2020; 11(2).

PMID: 32075093 PMC: 7074112. DOI: 10.3390/genes11020198.


NF-YA Overexpression in Lung Cancer: LUSC.

Bezzecchi E, Ronzio M, Dolfini D, Mantovani R Genes (Basel). 2019; 10(11).

PMID: 31744190 PMC: 6895822. DOI: 10.3390/genes10110937.


Analysis of Genomic Sequence Motifs for Deciphering Transcription Factor Binding and Transcriptional Regulation in Eukaryotic Cells.

Boeva V Front Genet. 2016; 7:24.

PMID: 26941778 PMC: 4763482. DOI: 10.3389/fgene.2016.00024.


References
1.
Shamir R, Maron-Katz A, Tanay A, Linhart C, Steinfeld I, Sharan R . EXPANDER--an integrative program suite for microarray data analysis. BMC Bioinformatics. 2005; 6:232. PMC: 1261157. DOI: 10.1186/1471-2105-6-232. View

2.
Dimova D, Dyson N . The E2F transcriptional network: old acquaintances with new faces. Oncogene. 2005; 24(17):2810-26. DOI: 10.1038/sj.onc.1208612. View

3.
Zhu Z, Shendure J, Church G . Discovering functional transcription-factor combinations in the human cell cycle. Genome Res. 2005; 15(6):848-55. PMC: 1142475. DOI: 10.1101/gr.3394405. View

4.
Bussemaker H, Li H, Siggia E . Regulatory element detection using correlation with expression. Nat Genet. 2001; 27(2):167-71. DOI: 10.1038/84792. View

5.
Marchler G, Schuller C, Ruis H, Estruch F . The Saccharomyces cerevisiae zinc finger proteins Msn2p and Msn4p are required for transcriptional induction through the stress response element (STRE). EMBO J. 1996; 15(9):2227-35. PMC: 450147. View