» Articles » PMID: 30191079

Generalized Score Functions for Causal Discovery

Overview
Journal KDD
Date 2018 Sep 8
PMID 30191079
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Discovery of causal relationships from observational data is a fundamental problem. Roughly speaking, there are two types of methods for causal discovery, constraint-based ones and score-based ones. Score-based methods avoid the multiple testing problem and enjoy certain advantages compared to constraint-based ones. However, most of them need strong assumptions on the functional forms of causal mechanisms, as well as on data distributions, which limit their applicability. In practice the precise information of the underlying model class is usually unknown. If the above assumptions are violated, both spurious and missing edges may result. In this paper, we introduce generalized score functions for causal discovery based on the characterization of general (conditional) independence relationships between random variables, without assuming particular model classes. In particular, we exploit regression in RKHS to capture the dependence in a non-parametric way. The resulting causal discovery approach produces asymptotically correct results in rather general cases, which may have nonlinear causal mechanisms, a wide class of data distributions, mixed continuous and discrete data, and multidimensional variables. Experimental results on both synthetic and real-world data demonstrate the efficacy of our proposed approach.

Citing Articles

A large-scale benchmark for network inference from single-cell perturbation data.

Chevalley M, Roohani Y, Mehrjou A, Leskovec J, Schwab P Commun Biol. 2025; 8(1):412.

PMID: 40069299 PMC: 11897147. DOI: 10.1038/s42003-025-07764-y.


Causal Inference for Hypertension Prediction With Wearable E lectrocardiogram and P hotoplethysmogram Signals: Feasibility Study.

Gon G K, Chen Y, Song X, Fu Z, Ding X JMIR Cardio. 2025; 9:e60238.

PMID: 39864408 PMC: 11811217. DOI: 10.2196/60238.


Mixed-variable graphical modeling framework towards risk prediction of hospital-acquired pressure injury in spinal cord injury individuals.

Li Y, Scheel-Sailer A, Riener R, Paez-Granados D Sci Rep. 2024; 14(1):25067.

PMID: 39443567 PMC: 11499609. DOI: 10.1038/s41598-024-75691-9.


Graphical modeling of causal factors associated with the postoperative survival of esophageal cancer subjects.

Ren S, Beeche C, Iyer K, Shi Z, Auster Q, Hawkins J Med Phys. 2023; 51(3):1997-2006.

PMID: 37523254 PMC: 10828112. DOI: 10.1002/mp.16656.


A Graph-Based Approach to Identify Factors Contributing to Postoperative Lung Cancer Recurrence among Patients with Non-Small-Cell Lung Cancer.

Iyer K, Ren S, Pu L, Mazur S, Zhao X, Dhupar R Cancers (Basel). 2023; 15(13).

PMID: 37444581 PMC: 10340686. DOI: 10.3390/cancers15133472.


References
1.
Hyvarinen A, Smith S . Pairwise Likelihood Ratios for Estimation of Non-Gaussian Structural Equation Models. J Mach Learn Res. 2019; 14(Jan):111-152. PMC: 6834441. View

2.
Imoto S, Goto T, Miyano S . Estimation of genetic networks and functional structures between genes by using Bayesian networks and nonparametric regression. Pac Symp Biocomput. 2002; :175-86. View

3.
Spirtes P, Zhang K . Causal discovery and inference: concepts and recent methodological advances. Appl Inform (Berl). 2016; 3:3. PMC: 4841209. DOI: 10.1186/s40535-016-0018-x. View

4.
Bakken T, Dale A, Schork N . A geographic cline of skull and brain morphology among individuals of European Ancestry. Hum Hered. 2011; 72(1):35-44. PMC: 3171282. DOI: 10.1159/000330168. View

5.
Zhang K, Scholkopf B, Spirtes P, Glymour C . Learning causality and causality-related learning: some recent progress. Natl Sci Rev. 2018; 5(1):26-29. PMC: 6051411. DOI: 10.1093/nsr/nwx137. View