» Articles » PMID: 38441258

Uncertainty-aware Single-cell Annotation with a Hierarchical Reject Option

Overview
Journal Bioinformatics
Specialty Biology
Date 2024 Mar 5
PMID 38441258
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Automatic cell type annotation methods assign cell type labels to new datasets by extracting relationships from a reference RNA-seq dataset. However, due to the limited resolution of gene expression features, there is always uncertainty present in the label assignment. To enhance the reliability and robustness of annotation, most machine learning methods address this uncertainty by providing a full reject option, i.e. when the predicted confidence score of a cell type label falls below a user-defined threshold, no label is assigned and no prediction is made. As a better alternative, some methods deploy hierarchical models and consider a so-called partial rejection by returning internal nodes of the hierarchy as label assignment. However, because a detailed experimental analysis of various rejection approaches is missing in the literature, there is currently no consensus on best practices.

Results: We evaluate three annotation approaches (i) full rejection, (ii) partial rejection, and (iii) no rejection for both flat and hierarchical probabilistic classifiers. Our findings indicate that hierarchical classifiers are superior when rejection is applied, with partial rejection being the preferred rejection approach, as it preserves a significant amount of label information. For optimal rejection implementation, the rejection threshold should be determined through careful examination of a method's rejection behavior. Without rejection, flat and hierarchical annotation perform equally well, as long as the cell type hierarchy accurately captures transcriptomic relationships.

Availability And Implementation: Code is freely available at https://github.com/Latheuni/Hierarchical_reject and https://doi.org/10.5281/zenodo.10697468.

References
1.
Li H, Janssens J, De Waegeneer M, Kolluru S, Davie K, Gardeux V . Fly Cell Atlas: A single-nucleus transcriptomic atlas of the adult fruit fly. Science. 2022; 375(6584):eabk2432. PMC: 8944923. DOI: 10.1126/science.abk2432. View

2.
Costa M, Reeve S, Grumbling G, Osumi-Sutherland D . The Drosophila anatomy ontology. J Biomed Semantics. 2013; 4(1):32. PMC: 4015547. DOI: 10.1186/2041-1480-4-32. View

3.
Wang S, Pisco A, McGeever A, Brbic M, Zitnik M, Darmanis S . Leveraging the Cell Ontology to classify unseen cell types. Nat Commun. 2021; 12(1):5556. PMC: 8455606. DOI: 10.1038/s41467-021-25725-x. View

4.
Kaymaz Y, Ganglberger F, Tang M, Haslinger C, Fernandez-Albert F, Lawless N . HieRFIT: a hierarchical cell type classification tool for projections from complex single-cell atlas datasets. Bioinformatics. 2021; 37(23):4431-4436. DOI: 10.1093/bioinformatics/btab499. View

5.
Cao Z, Wei L, Lu S, Yang D, Gao G . Searching large-scale scRNA-seq databases via unbiased cell embedding with Cell BLAST. Nat Commun. 2020; 11(1):3458. PMC: 7351785. DOI: 10.1038/s41467-020-17281-7. View