Machine Learning in Medicine: a Practical Introduction

Overview

Journal BMC Med Res Methodol

Publisher Biomed Central

Specialties General Medicine
Health Services

Date 2019 Mar 21

PMID 30890124

Citations 334

Authors

Jenni A M Sidey-Gibbons

Chris J Sidey-Gibbons

Affiliations

Soon will be listed here.

Abstract

Background: Following visible successes on a wide range of predictive tasks, machine learning techniques are attracting substantial interest from medical researchers and clinicians. We address the need for capacity development in this area by providing a conceptual introduction to machine learning alongside a practical guide to developing and evaluating predictive algorithms using freely-available open source software and public domain data.

Methods: We demonstrate the use of machine learning techniques by developing three predictive models for cancer diagnosis using descriptions of nuclei sampled from breast masses. These algorithms include regularized General Linear Model regression (GLMs), Support Vector Machines (SVMs) with a radial basis function kernel, and single-layer Artificial Neural Networks. The publicly-available dataset describing the breast mass samples (N=683) was randomly split into evaluation (n=456) and validation (n=227) samples. We trained algorithms on data from the evaluation sample before they were used to predict the diagnostic outcome in the validation dataset. We compared the predictions made on the validation datasets with the real-world diagnostic decisions to calculate the accuracy, sensitivity, and specificity of the three models. We explored the use of averaging and voting ensembles to improve predictive performance. We provide a step-by-step guide to developing algorithms using the open-source R statistical programming environment.

Results: The trained algorithms were able to classify cell nuclei with high accuracy (.94 -.96), sensitivity (.97 -.99), and specificity (.85 -.94). Maximum accuracy (.96) and area under the curve (.97) was achieved using the SVM algorithm. Prediction performance increased marginally (accuracy =.97, sensitivity =.99, specificity =.95) when algorithms were arranged into a voting ensemble.

Conclusions: We use a straightforward example to demonstrate the theory and practice of machine learning for clinicians and medical researchers. The principals which we demonstrate here can be readily applied to other complex tasks including natural language processing and image recognition.

Citing Articles

Predicting total healthcare demand using machine learning: separate and combined analysis of predisposing, enabling, and need factors.

Orhan F, Kurutkan M BMC Health Serv Res. 2025; 25(1):366.

PMID: 40075408 PMC: 11900254. DOI: 10.1186/s12913-025-12502-5.

Stacking Model-Based Classifiers for Dealing With Multiple Sets of Noisy Labels.

Montani G, Cappozzo A Biom J. 2025; 67(2):e70042.

PMID: 40071867 PMC: 11898607. DOI: 10.1002/bimj.70042.

Advanced applications in chronic disease monitoring using IoT mobile sensing device data, machine learning algorithms and frame theory: a systematic review.

Liu Y, Wang B Front Public Health. 2025; 13:1510456.

PMID: 40061474 PMC: 11885302. DOI: 10.3389/fpubh.2025.1510456.

A bibliometric analysis of the advance of artificial intelligence in medicine.

Lin M, Lin L, Lin L, Lin Z, Yan X Front Med (Lausanne). 2025; 12:1504428.

PMID: 40061376 PMC: 11885233. DOI: 10.3389/fmed.2025.1504428.

Applications of Artificial Intelligence for the Prediction and Diagnosis of Cancer Therapy-Related Cardiac Dysfunction in Oncology Patients.

Scalia I, Pathangey G, Abdelnabi M, Ibrahim O, Abdelfattah F, Pereyra Pietri M Cancers (Basel). 2025; 17(4).

PMID: 40002200 PMC: 11852369. DOI: 10.3390/cancers17040605.

References

Bland J, Altman D . Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986; 1(8476):307-10. View

WOLBERG W, Mangasarian O . Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci U S A. 1990; 87(23):9193-6. PMC: 55130. DOI: 10.1073/pnas.87.23.9193. View

Wagland R, Recio-Saucedo A, Simon M, Bracher M, Hunt K, Foster C . Development and testing of a text-mining approach to analyse patients' comments on their experiences of colorectal cancer care. BMJ Qual Saf. 2015; 25(8):604-14. DOI: 10.1136/bmjqs-2015-004063. View

Hawkins J, Brownstein J, Tuli G, Runels T, Broecker K, Nsoesie E . Measuring patient-perceived quality of care in US hospitals using Twitter. BMJ Qual Saf. 2015; 25(6):404-13. PMC: 4878682. DOI: 10.1136/bmjqs-2015-004309. View

Gibbons C, Richards S, Valderas J, Campbell J . Supervised Machine Learning Algorithms Can Classify Open-Text Feedback of Doctor Performance With Human-Level Accuracy. J Med Internet Res. 2017; 19(3):e65. PMC: 5371715. DOI: 10.2196/jmir.6533. View

Banerjee S, Zare R, Tibshirani R, Kunder C, Nolley R, Fan R . Diagnosis of prostate cancer by desorption electrospray ionization mass spectrometric imaging of small metabolites and lipids. Proc Natl Acad Sci U S A. 2017; 114(13):3334-3339. PMC: 5380053. DOI: 10.1073/pnas.1700677114. View

Kosinski M, Stillwell D, Graepel T . Private traits and attributes are predictable from digital records of human behavior. Proc Natl Acad Sci U S A. 2013; 110(15):5802-5. PMC: 3625324. DOI: 10.1073/pnas.1218772110. View

Darcy A, Louie A, Roberts L . Machine Learning and the Profession of Medicine. JAMA. 2016; 315(6):551-2. DOI: 10.1001/jama.2015.18421. View

Friedman C, Wong A, Blumenthal D . Achieving a nationwide learning health system. Sci Transl Med. 2010; 2(57):57cm29. DOI: 10.1126/scitranslmed.3001456. View

10.

Jordan M, Mitchell T . Machine learning: Trends, perspectives, and prospects. Science. 2015; 349(6245):255-60. DOI: 10.1126/science.aaa8415. View

11.

Ong M, Magrabi F, Coiera E . Automated identification of extreme-risk events in clinical incident reports. J Am Med Inform Assoc. 2012; 19(e1):e110-8. PMC: 3392867. DOI: 10.1136/amiajnl-2011-000562. View

12.

Beam A, Kohane I . Big Data and Machine Learning in Health Care. JAMA. 2018; 319(13):1317-1318. DOI: 10.1001/jama.2017.18391. View

13.

Esteva A, Kuprel B, Novoa R, Ko J, Swetter S, Blau H . Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017; 542(7639):115-118. PMC: 8382232. DOI: 10.1038/nature21056. View

14.

Anderson J, Parikh J, Shenfeld D, Ivanov V, Marks C, Church B . Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes: An Application of Machine Learning Using Electronic Health Records. J Diabetes Sci Technol. 2015; 10(1):6-18. PMC: 4738229. DOI: 10.1177/1932296815620200. View

15.

Hanley J, McNeil B . The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982; 143(1):29-36. DOI: 10.1148/radiology.143.1.7063747. View

16.

Greaves F, Ramirez-Cano D, Millett C, Darzi A, Donaldson L . Use of sentiment analysis for capturing patient experience from free-text comments posted online. J Med Internet Res. 2013; 15(11):e239. PMC: 3841376. DOI: 10.2196/jmir.2721. View

17.

WOLBERG W, Street W, Mangasarian O . Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 1994; 77(2-3):163-71. DOI: 10.1016/0304-3835(94)90099-x. View

18.

Lazer D, Kennedy R, King G, Vespignani A . Big data. The parable of Google Flu: traps in big data analysis. Science. 2014; 343(6176):1203-5. DOI: 10.1126/science.1248506. View

19.

Bedi G, Carrillo F, Cecchi G, Fernandez Slezak D, Sigman M, Mota N . Automated analysis of free speech predicts psychosis onset in high-risk youths. NPJ Schizophr. 2016; 1:15030. PMC: 4849456. DOI: 10.1038/npjschz.2015.30. View

20.

Haider A, Chang D, Efron D, Haut E, Crandall M, Cornwell 3rd E . Race and insurance status as risk factors for trauma mortality. Arch Surg. 2008; 143(10):945-9. DOI: 10.1001/archsurg.143.10.945. View