» Articles » PMID: 37756695

A Cell-level Discriminative Neural Network Model for Diagnosis of Blood Cancers

Overview
Journal Bioinformatics
Specialty Biology
Date 2023 Sep 27
PMID 37756695
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Precise identification of cancer cells in patient samples is essential for accurate diagnosis and clinical monitoring but has been a significant challenge in machine learning approaches for cancer precision medicine. In most scenarios, training data are only available with disease annotation at the subject or sample level. Traditional approaches separate the classification process into multiple steps that are optimized independently. Recent methods either focus on predicting sample-level diagnosis without identifying individual pathologic cells or are less effective for identifying heterogeneous cancer cell phenotypes.

Results: We developed a generalized end-to-end differentiable model, the Cell Scoring Neural Network (CSNN), which takes sample-level training data and predicts the diagnosis of the testing samples and the identity of the diagnostic cells in the sample, simultaneously. The cell-level density differences between samples are linked to the sample diagnosis, which allows the probabilities of individual cells being diagnostic to be calculated using backpropagation. We applied CSNN to two independent clinical flow cytometry datasets for leukemia diagnosis. In both qualitative and quantitative assessments, CSNN outperformed preexisting neural network modeling approaches for both cancer diagnosis and cell-level classification. Post hoc decision trees and 2D dot plots were generated for interpretation of the identified cancer cells, showing that the identified cell phenotypes match the cancer endotypes observed clinically in patient cohorts. Independent data clustering analysis confirmed the identified cancer cell populations.

Availability And Implementation: The source code of CSNN and datasets used in the experiments are publicly available on GitHub (http://github.com/erobl/csnn). Raw FCS files can be downloaded from FlowRepository (ID: FR-FCM-Z6YK).

Citing Articles

Deep Learning in Hematology: From Molecules to Patients.

Wang J Clin Hematol Int. 2024; 6(4):19-42.

PMID: 39417017 PMC: 11477942. DOI: 10.46989/001c.124131.

References
1.
Monaghan S, Li J, Liu Y, Ko M, Boyiadzis M, Chang T . A Machine Learning Approach to the Classification of Acute Leukemias and Distinction From Nonneoplastic Cytopenias Using Flow Cytometry Data. Am J Clin Pathol. 2021; 157(4):546-553. DOI: 10.1093/ajcp/aqab148. View

2.
Shekhar K, Brodin P, Davis M, Chakraborty A . Automatic Classification of Cellular Expression by Nonlinear Stochastic Embedding (ACCENSE). Proc Natl Acad Sci U S A. 2013; 111(1):202-7. PMC: 3890841. DOI: 10.1073/pnas.1321405111. View

3.
Lazarus M, Turner-Stokes T, Chavele K, Isenberg D, Ehrenstein M . B-cell numbers and phenotype at clinical relapse following rituximab therapy differ in SLE patients according to anti-dsDNA antibody levels. Rheumatology (Oxford). 2012; 51(7):1208-15. PMC: 3380246. DOI: 10.1093/rheumatology/ker526. View

4.
Ji D, Putzel P, Qian Y, Chang I, Mandava A, Scheuermann R . Machine Learning of Discriminative Gate Locations for Clinical Diagnosis. Cytometry A. 2019; 97(3):296-307. PMC: 7079150. DOI: 10.1002/cyto.a.23906. View

5.
Lux M, Brinkman R, Chauve C, Laing A, Lorenc A, Abeler-Dorner L . flowLearn: fast and precise identification and quality checking of cell populations in flow cytometry. Bioinformatics. 2018; 34(13):2245-2253. PMC: 6022609. DOI: 10.1093/bioinformatics/bty082. View