» Articles » PMID: 34620993

Biological Data Annotation Via a Human-augmenting AI-based Labeling System

Overview
Journal NPJ Digit Med
Date 2021 Oct 8
PMID 34620993
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Biology has become a prime area for the deployment of deep learning and artificial intelligence (AI), enabled largely by the massive data sets that the field can generate. Key to most AI tasks is the availability of a sufficiently large, labeled data set with which to train AI models. In the context of microscopy, it is easy to generate image data sets containing millions of cells and structures. However, it is challenging to obtain large-scale high-quality annotations for AI models. Here, we present HALS (Human-Augmenting Labeling System), a human-in-the-loop data labeling AI, which begins uninitialized and learns annotations from a human, in real-time. Using a multi-part AI composed of three deep learning models, HALS learns from just a few examples and immediately decreases the workload of the annotator, while increasing the quality of their annotations. Using a highly repetitive use-case-annotating cell types-and running experiments with seven pathologists-experts at the microscopic analysis of biological specimens-we demonstrate a manual work reduction of 90.60%, and an average data-quality boost of 4.34%, measured across four use-cases and two tissue stain types.

Citing Articles

A pathologist-AI collaboration framework for enhancing diagnostic accuracies and efficiencies.

Huang Z, Yang E, Shen J, Gratzinger D, Eyerer F, Liang B Nat Biomed Eng. 2024; .

PMID: 38898173 DOI: 10.1038/s41551-024-01223-5.


Boosting wisdom of the crowd for medical image annotation using training performance and task features.

Hasan E, Duhaime E, Trueblood J Cogn Res Princ Implic. 2024; 9(1):31.

PMID: 38763994 PMC: 11102897. DOI: 10.1186/s41235-024-00558-6.


Variational Autoencoders for Biomedical Signal Morphology Clustering and Noise Detection.

Nowroozilarki Z, Mortazavi B, Jafari R IEEE J Biomed Health Inform. 2023; PP.

PMID: 37768790 PMC: 10984704. DOI: 10.1109/JBHI.2023.3320585.


From function to translation: Decoding genetic susceptibility to human diseases via artificial intelligence.

Long E, Wan P, Chen Q, Lu Z, Choi J Cell Genom. 2023; 3(6):100320.

PMID: 37388909 PMC: 10300605. DOI: 10.1016/j.xgen.2023.100320.


Which data subset should be augmented for deep learning? a simulation study using urothelial cell carcinoma histopathology images.

Ameen Y, Badary D, Abonnoor A, Hussain K, Sewisy A BMC Bioinformatics. 2023; 24(1):75.

PMID: 36869300 PMC: 9983182. DOI: 10.1186/s12859-023-05199-y.


References
1.
Nalisnik M, Amgad M, Lee S, Halani S, Velazquez Vega J, Brat D . Interactive phenotyping of large-scale histology imaging data with HistomicsML. Sci Rep. 2017; 7(1):14588. PMC: 5674015. DOI: 10.1038/s41598-017-15092-3. View

2.
Ellis M, Suman V, Hoog J, Goncalves R, Sanati S, Creighton C . Ki67 Proliferation Index as a Tool for Chemotherapy Decisions During and After Neoadjuvant Aromatase Inhibitor Treatment of Breast Cancer: Results From the American College of Surgeons Oncology Group Z1031 Trial (Alliance). J Clin Oncol. 2017; 35(10):1061-1069. PMC: 5455353. DOI: 10.1200/JCO.2016.69.4406. View

3.
LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

4.
Esteva A, Kuprel B, Novoa R, Ko J, Swetter S, Blau H . Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017; 542(7639):115-118. PMC: 8382232. DOI: 10.1038/nature21056. View

5.
Hendry S, Salgado R, Gevaert T, Russell P, John T, Thapa B . Assessing Tumor-infiltrating Lymphocytes in Solid Tumors: A Practical Review for Pathologists and Proposal for a Standardized Method From the International Immunooncology Biomarkers Working Group: Part 1: Assessing the Host Immune Response, TILs in.... Adv Anat Pathol. 2017; 24(5):235-251. PMC: 5564448. DOI: 10.1097/PAP.0000000000000162. View