Design Principles for Clinical Network-based Proteomics
Overview
Authors
Affiliations
Integrating biological networks with proteomics is a tantalizing option for system-level analysis; for example it can help remove false-positives from proteomics data and improve coverage by detecting false-negatives, as well as resolving inconsistent inter-sample protein expression due to biological heterogeneity. Yet, designing a robust network-based analysis strategy on proteomics data is nontrivial. The issues include dealing with test set bias caused by, for example, inappropriate normalization procedure, devising appropriate benchmarking criteria and formulating statistically robust feature-selection techniques. Given the increasing importance of proteomics in contemporary clinical studies, more powerful network-based approaches are needed. We provide some design principles and considerations that can help achieve this, while taking into account the idiosyncrasies of proteomics data.
Resolving missing protein problems using functional class scoring.
Wong B, Kong W, Wong L, Goh W Sci Rep. 2022; 12(1):11358.
PMID: 35790756 PMC: 9256666. DOI: 10.1038/s41598-022-15314-3.
Analysing omics data sets with weighted nodes networks (WNNets).
Tosadori G, Silvestre D, Spoto F, Mauri P, Laudanna C, Scardoni G Sci Rep. 2021; 11(1):14447.
PMID: 34262093 PMC: 8280138. DOI: 10.1038/s41598-021-93699-3.
Avoid Oversimplifications in Machine Learning: Going beyond the Class-Prediction Accuracy.
Ho S, Wong L, Goh W Patterns (N Y). 2020; 1(2):100025.
PMID: 33205097 PMC: 7660406. DOI: 10.1016/j.patter.2020.100025.
The Birth of Bio-data Science: Trends, Expectations, and Applications.
Goh W, Wong L Genomics Proteomics Bioinformatics. 2020; 18(1):5-15.
PMID: 32428604 PMC: 7393550. DOI: 10.1016/j.gpb.2020.01.002.
Goh W, Sng J, Yee J, See Y, Lee T, Wong L Comput Psychiatr. 2018; 1:168-183.
PMID: 30090857 PMC: 6067827. DOI: 10.1162/CPSY_a_00007.