» Articles » PMID: 28392994

Medical Big Data: Promise and Challenges

Overview
Specialty Nephrology
Date 2017 Apr 11
PMID 28392994
Citations 108
Authors
Affiliations
Soon will be listed here.
Abstract

The concept of big data, commonly characterized by volume, variety, velocity, and veracity, goes far beyond the data type and includes the aspects of data analysis, such as hypothesis-generating, rather than hypothesis-testing. Big data focuses on temporal stability of the association, rather than on causal relationship and underlying probability distribution assumptions are frequently not required. Medical big data as material to be analyzed has various features that are not only distinct from big data of other disciplines, but also distinct from traditional clinical epidemiology. Big data technology has many areas of application in healthcare, such as predictive modeling and clinical decision support, disease or safety surveillance, public health, and research. Big data analytics frequently exploits analytic methods developed in data mining, including classification, clustering, and regression. Medical big data analyses are complicated by many technical issues, such as missing values, curse of dimensionality, and bias control, and share the inherent limitations of observation study, namely the inability to test causality resulting from residual confounding and reverse causation. Recently, propensity score analysis and instrumental variable analysis have been introduced to overcome these limitations, and they have accomplished a great deal. Many challenges, such as the absence of evidence of practical benefits of big data, methodological issues including legal and ethical issues, and clinical integration and utility issues, must be overcome to realize the promise of medical big data as the fuel of a continuous learning healthcare system that will improve patient outcome and reduce waste in areas including nephrology.

Citing Articles

Clinical validation and optimization of machine learning models for early prediction of sepsis.

Liu X, Li M, Liu X, Luo Y, Yang D, Ouyang H Front Med (Lausanne). 2025; 12:1521660.

PMID: 39975676 PMC: 11836818. DOI: 10.3389/fmed.2025.1521660.


The impacts on population health by China's regional health data centers and the potential mechanism of influence.

Cai J, Li Y, Coyte P Digit Health. 2025; 11():20552076251314102.

PMID: 39830144 PMC: 11742170. DOI: 10.1177/20552076251314102.


Detection of Disease Features on Retinal OCT Scans Using RETFound.

Du K, Nair A, Shah S, Gadari A, Vupparaboina S, Bollepalli S Bioengineering (Basel). 2025; 11(12.

PMID: 39768004 PMC: 11672910. DOI: 10.3390/bioengineering11121186.


Synthetic Breast Ultrasound Images: A Study to Overcome Medical Data Sharing Barriers.

Xu J, Hua Q, Jia X, Zheng Y, Hu Q, Bai B Research (Wash D C). 2024; 7:0532.

PMID: 39628833 PMC: 11612121. DOI: 10.34133/research.0532.


Information Mode-Dependent Success Rates of Obtaining German Medical Informatics Initiative-Compliant Broad Consent in the Emergency Department: Single-Center Prospective Observational Study.

Hans F, Kleinekort J, Boerries M, Nieters A, Kindle G, Rautenberg M JMIR Med Inform. 2024; 12():e65646.

PMID: 39626089 PMC: 11688594. DOI: 10.2196/65646.


References
1.
Boef A, Dekkers O, le Cessie S . Mendelian randomization studies: a review of the approaches used and the quality of reporting. Int J Epidemiol. 2015; 44(2):496-511. DOI: 10.1093/ije/dyv071. View

2.
Murdoch T, Detsky A . The inevitable application of big data to health care. JAMA. 2013; 309(13):1351-2. DOI: 10.1001/jama.2013.393. View

3.
Bellazzi R . Big data and biomedical informatics: a challenging opportunity. Yearb Med Inform. 2014; 9:8-13. PMC: 4287065. DOI: 10.15265/IY-2014-0024. View

4.
Saeys Y, Inza I, Larranaga P . A review of feature selection techniques in bioinformatics. Bioinformatics. 2007; 23(19):2507-17. DOI: 10.1093/bioinformatics/btm344. View

5.
Ketchersid T . Big data in nephrology: friend or foe?. Blood Purif. 2014; 36(3-4):160-4. DOI: 10.1159/000356751. View