» Articles » PMID: 15285897

Maximum Entropy Modeling of Short Sequence Motifs with Applications to RNA Splicing Signals

Overview
Journal J Comput Biol
Date 2004 Aug 3
PMID 15285897
Citations 1086
Authors
Affiliations
Soon will be listed here.
Abstract

We propose a framework for modeling sequence motifs based on the maximum entropy principle (MEP). We recommend approximating short sequence motif distributions with the maximum entropy distribution (MED) consistent with low-order marginal constraints estimated from available data, which may include dependencies between nonadjacent as well as adjacent positions. Many maximum entropy models (MEMs) are specified by simply changing the set of constraints. Such models can be utilized to discriminate between signals and decoys. Classification performance using different MEMs gives insight into the relative importance of dependencies between different positions. We apply our framework to large datasets of RNA splicing signals. Our best models out-perform previous probabilistic models in the discrimination of human 5' (donor) and 3' (acceptor) splice sites from decoys. Finally, we discuss mechanistically motivated ways of comparing models.

Citing Articles

Exome sequencing identifies a homozygous splice site variant in as the underlying cause of autosomal recessive retinitis pigmentosa in a Pakistani family.

Rashid A, Munir A, Zahid M, Ullah M, Rehman A Ann Med. 2025; 57(1):2470953.

PMID: 40029043 PMC: 11878163. DOI: 10.1080/07853890.2025.2470953.


Precursor RNA structural patterns at SF3B1 mutation sensitive cryptic 3' splice sites.

Herbert A, Hatfield A, Randazza A, Miyamoto V, Palmer K, Lackey L bioRxiv. 2025; .

PMID: 40027643 PMC: 11870503. DOI: 10.1101/2025.02.19.638873.


Difference Analysis Among Six Kinds of Acceptor Splicing Sequences by the Dispersion Features of 6-mer Subsets in Human Genes.

Si Y, Li H, Li X Biology (Basel). 2025; 14(2).

PMID: 40001974 PMC: 11853274. DOI: 10.3390/biology14020206.


Sequence-dependent and -independent effects of intron-mediated enhancement learned from thousands of random introns.

Kowal E, Sakai Y, McGurk M, Pasetsky Z, Burge C Nucleic Acids Res. 2025; 53(4).

PMID: 39995040 PMC: 11850230. DOI: 10.1093/nar/gkaf097.


STIM1-mediated NFAT signaling synergizes with STAT1 to control T-bet expression and T1 differentiation.

Zhong L, Wang Y, Kahlfuss S, Jishage M, McDermott M, Yang J Nat Immunol. 2025; 26(3):484-496.

PMID: 39984734 DOI: 10.1038/s41590-025-02089-8.