» Articles » PMID: 38434622

Thinking Process Templates for Constructing Data Stories with SCDNEY

Overview
Journal F1000Res
Date 2024 Mar 4
PMID 38434622
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Globally, scientists now have the ability to generate a vast amount of high throughput biomedical data that carry critical information for important clinical and public health applications. This data revolution in biology is now creating a plethora of new single-cell datasets. Concurrently, there have been significant methodological advances in single-cell research. Integrating these two resources, creating tailor-made, efficient, and purpose-specific data analysis approaches can assist in accelerating scientific discovery.

Methods: We developed a series of living workshops for building data stories, using Single-cell data integrative analysis (scdney). scdney is a wrapper package with a collection of single-cell analysis R packages incorporating data integration, cell type annotation, higher order testing and more.

Results: Here, we illustrate two specific workshops. The first workshop examines how to characterise the identity and/or state of cells and the relationship between them, known as phenotyping. The second workshop focuses on extracting higher-order features from cells to predict disease progression.

Conclusions: Through these workshops, we not only showcase current solutions, but also highlight critical thinking points. In particular, we highlight the Thinking Process Template that provides a structured framework for the decision-making process behind such single-cell analyses. Furthermore, our workshop will incorporate dynamic contributions from the community in a collaborative learning approach, thus the term 'living'.

References
1.
Kim H, Wang K, Chen C, Lin Y, Tam P, Lin D . Uncovering cell identity through differential stability with Cepo. Nat Comput Sci. 2024; 1(12):784-790. DOI: 10.1038/s43588-021-00172-2. View

2.
La Manno G, Soldatov R, Zeisel A, Braun E, Hochgerner H, Petukhov V . RNA velocity of single cells. Nature. 2018; 560(7719):494-498. PMC: 6130801. DOI: 10.1038/s41586-018-0414-6. View

3.
Kim H, Lin Y, Geddes T, Yang J, Yang P . CiteFuse enables multi-modal analysis of CITE-seq data. Bioinformatics. 2020; 36(14):4137-4143. DOI: 10.1093/bioinformatics/btaa282. View

4.
Breckels L, Mulvey C, Lilley K, Gatto L . A Bioconductor workflow for processing and analysing spatial proteomics data. F1000Res. 2018; 5:2926. PMC: 6053703. DOI: 10.12688/f1000research.10411.2. View

5.
Lin Y, Cao Y, Kim H, Salim A, Speed T, Lin D . scClassify: sample size estimation and multiscale classification of cells using single and multiple reference. Mol Syst Biol. 2020; 16(6):e9389. PMC: 7306901. DOI: 10.15252/msb.20199389. View