» Articles » PMID: 27683156

Prediction and Validation of Protein-protein Interactors from Genome-wide DNA-binding Data Using a Knowledge-based Machine-learning Approach

Overview
Journal Open Biol
Date 2016 Sep 30
PMID 27683156
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

The ability to accurately predict the DNA targets and interacting cofactors of transcriptional regulators from genome-wide data can significantly advance our understanding of gene regulatory networks. NKX2-5 is a homeodomain transcription factor that sits high in the cardiac gene regulatory network and is essential for normal heart development. We previously identified genomic targets for NKX2-5 in mouse HL-1 atrial cardiomyocytes using DNA-adenine methyltransferase identification (DamID). Here, we apply machine learning algorithms and propose a knowledge-based feature selection method for predicting NKX2-5 protein : protein interactions based on motif grammar in genome-wide DNA-binding data. We assessed model performance using leave-one-out cross-validation and a completely independent DamID experiment performed with replicates. In addition to identifying previously described NKX2-5-interacting proteins, including GATA, HAND and TBX family members, a number of novel interactors were identified, with direct protein : protein interactions between NKX2-5 and retinoid X receptor (RXR), paired-related homeobox (PRRX) and Ikaros zinc fingers (IKZF) validated using the yeast two-hybrid assay. We also found that the interaction of RXRα with NKX2-5 mutations found in congenital heart disease (Q187H, R189G and R190H) was altered. These findings highlight an intuitive approach to accessing protein-protein interaction information of transcription factors in DNA-binding experiments.

Citing Articles

Mesoderm patterning by a dynamic gradient of retinoic acid signalling.

Bernheim S, Meilhac S Philos Trans R Soc Lond B Biol Sci. 2020; 375(1809):20190556.

PMID: 32829679 PMC: 7482219. DOI: 10.1098/rstb.2019.0556.


Predicting transcription factor binding using ensemble random forest models.

Behjati Ardakani F, Schmidt F, Schulz M F1000Res. 2019; 7:1603.

PMID: 31723409 PMC: 6823902. DOI: 10.12688/f1000research.16200.2.


Model-based and Model-free Machine Learning Techniques for Diagnostic Prediction and Classification of Clinical Outcomes in Parkinson's Disease.

Gao C, Sun H, Wang T, Tang M, Bohnen N, Muller M Sci Rep. 2018; 8(1):7129.

PMID: 29740058 PMC: 5940671. DOI: 10.1038/s41598-018-24783-4.

References
1.
von Both I, Silvestri C, Erdemir T, Lickert H, Walls J, Henkelman R . Foxh1 is essential for development of the anterior heart field. Dev Cell. 2004; 7(3):331-45. DOI: 10.1016/j.devcel.2004.07.023. View

2.
Li Z, Song C, Ouyang H, Lai L, Payne K, Dovat S . Cell cycle-specific function of Ikaros in human leukemia. Pediatr Blood Cancer. 2011; 59(1):69-76. PMC: 3292658. DOI: 10.1002/pbc.23406. View

3.
Hu Y, Luscher B, Admon A, Mermod N, Tjian R . Transcription factor AP-4 contains multiple dimerization domains that regulate dimer specificity. Genes Dev. 1990; 4(10):1741-52. DOI: 10.1101/gad.4.10.1741. View

4.
Narlikar L, Sakabe N, Blanski A, Arimura F, Westlund J, Nobrega M . Genome-wide discovery of human heart enhancers. Genome Res. 2010; 20(3):381-92. PMC: 2840982. DOI: 10.1101/gr.098657.109. View

5.
Kasahara H, Benson D . Biochemical analyses of eight NKX2.5 homeodomain missense mutations causing atrioventricular block and cardiac anomalies. Cardiovasc Res. 2004; 64(1):40-51. DOI: 10.1016/j.cardiores.2004.06.004. View