Reference-free Transcriptome Signatures for Prostate Cancer Prognosis
Overview
Affiliations
Background: RNA-seq data are increasingly used to derive prognostic signatures for cancer outcome prediction. A limitation of current predictors is their reliance on reference gene annotations, which amounts to ignoring large numbers of non-canonical RNAs produced in disease tissues. A recently introduced kind of transcriptome classifier operates entirely in a reference-free manner, relying on k-mers extracted from patient RNA-seq data.
Methods: In this paper, we set out to compare conventional and reference-free signatures in risk and relapse prediction of prostate cancer. To compare the two approaches as fairly as possible, we set up a common procedure that takes as input either a k-mer count matrix or a gene expression matrix, extracts a signature and evaluates this signature in an independent dataset.
Results: We find that both gene-based and k-mer based classifiers had similarly high performances for risk prediction and a markedly lower performance for relapse prediction. Interestingly, the reference-free signatures included a set of sequences mapping to novel lncRNAs or variable regions of cancer driver genes that were not part of gene-based signatures.
Conclusions: Reference-free classifiers are thus a promising strategy for the identification of novel prognostic RNA biomarkers.
Genomic strategies for drug repurposing.
Dave K, Patel D, Dave N, Jain M J Egypt Natl Canc Inst. 2024; 36(1):35.
PMID: 39523244 DOI: 10.1186/s43046-024-00245-z.
A survey of k-mer methods and applications in bioinformatics.
Moeckel C, Mareboina M, Konnaris M, Chan C, Mouratidis I, Montgomery A Comput Struct Biotechnol J. 2024; 23:2289-2303.
PMID: 38840832 PMC: 11152613. DOI: 10.1016/j.csbj.2024.05.025.
Wu M, Pan C, He Y, Yang B Cancer Manag Res. 2023; 15:1015-1024.
PMID: 37746314 PMC: 10516215. DOI: 10.2147/CMAR.S425181.
Mou Z, Spencer J, Knight B, John J, McCullagh P, McGrath J Front Oncol. 2022; 12:914078.
PMID: 36033512 PMC: 9413154. DOI: 10.3389/fonc.2022.914078.
Replicability in cancer omics data analysis: measures and empirical explorations.
Wang J, Liang H, Zhang Q, Ma S Brief Bioinform. 2022; 23(5).
PMID: 35876281 PMC: 9487717. DOI: 10.1093/bib/bbac304.