» Articles » PMID: 23144783

Ranking Transitive Chemical-disease Inferences Using Local Network Topology in the Comparative Toxicogenomics Database

Overview
Journal PLoS One
Date 2012 Nov 13
PMID 23144783
Citations 28
Authors
Affiliations
Soon will be listed here.
Abstract

Exposure to chemicals in the environment is believed to play a critical role in the etiology of many human diseases. To enhance understanding about environmental effects on human health, the Comparative Toxicogenomics Database (CTD; http://ctdbase.org) provides unique curated data that enable development of novel hypotheses about the relationships between chemicals and diseases. CTD biocurators read the literature and curate direct relationships between chemicals-genes, genes-diseases, and chemicals-diseases. These direct relationships are then computationally integrated to create additional inferred relationships; for example, a direct chemical-gene statement can be combined with a direct gene-disease statement to generate a chemical-disease inference (inferred via the shared gene). In CTD, the number of inferences has increased exponentially as the number of direct chemical, gene and disease interactions has grown. To help users navigate and prioritize these inferences for hypothesis development, we implemented a statistic to score and rank them based on the topology of the local network consisting of the chemical, disease and each of the genes used to make an inference. In this network, chemicals, diseases and genes are nodes connected by edges representing the curated interactions. Like other biological networks, node connectivity is an important consideration when evaluating the CTD network, as the connectivity of nodes follows the power-law distribution. Topological methods reduce the influence of highly connected nodes that are present in biological networks. We evaluated published methods that used local network topology to determine the reliability of protein-protein interactions derived from high-throughput assays. We developed a new metric that combines and weights two of these methods and uniquely takes into account the number of common neighbors and the connectivity of each entity involved. We present several CTD inferences as case studies to demonstrate the value of this metric and the biological relevance of the inferences.

Citing Articles

Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database.

Wiegers T, Davis A, Wiegers J, Sciaky D, Barkalow F, Wyatt B Database (Oxford). 2025; 2025.

PMID: 39982792 PMC: 11844237. DOI: 10.1093/database/baaf013.


Per-and polyfluoroalkyl substances and disrupted sleep: mediating roles of proteins.

Li S, Goodrich J, Chen J, Costello E, Beglarian E, Liao J Environ Adv. 2024; 17.

PMID: 39512894 PMC: 11542765. DOI: 10.1016/j.envadv.2024.100585.


Involvement of , , and genes in breast cancer and muscle cell development.

Dastsooz H, Anselmi F, Lauria A, Cicconetti C, Proserpio V, Mohammadisoleimani E Front Cell Dev Biol. 2024; 12:1295403.

PMID: 38859961 PMC: 11163233. DOI: 10.3389/fcell.2024.1295403.


Global comparative transcriptomes uncover novel and population-specific gene expression in esophageal squamous cell carcinoma.

Alotaibi A, Gadekar V, Gundla P, Mandarthi S, Jayendra N, Tungekar A Infect Agent Cancer. 2023; 18(1):47.

PMID: 37641095 PMC: 10463703. DOI: 10.1186/s13027-023-00525-8.


CTD tetramers: a new online tool that computationally links curated chemicals, genes, phenotypes, and diseases to inform molecular mechanisms for environmental health.

Davis A, Wiegers T, Wiegers J, Wyatt B, Johnson R, Sciaky D Toxicol Sci. 2023; 195(2):155-168.

PMID: 37486259 PMC: 10535784. DOI: 10.1093/toxsci/kfad069.


References
1.
Giri V, Cassidy A, Beebe-Dimmer J, Ellis L, Smith D, Bock C . Association between Agent Orange and prostate cancer: a pilot case-control study. Urology. 2004; 63(4):757-60. DOI: 10.1016/j.urology.2003.11.044. View

2.
Evans J, Rzhetsky A . Philosophy of science. Machine science. Science. 2010; 329(5990):399-400. PMC: 3647224. DOI: 10.1126/science.1189416. View

3.
Huang D, Sherman B, Lempicki R . Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009; 4(1):44-57. DOI: 10.1038/nprot.2008.211. View

4.
. Some drinking-water disinfectants and contaminants, including arsenic. IARC Monogr Eval Carcinog Risks Hum. 2005; 84:1-477. PMC: 7682301. View

5.
La Merrill M, Harper R, Birnbaum L, Cardiff R, Threadgill D . Maternal dioxin exposure combined with a diet high in fat increases mammary cancer incidence in mice. Environ Health Perspect. 2010; 118(5):596-601. PMC: 2866672. DOI: 10.1289/ehp.0901047. View