» Articles » PMID: 24707821

Multi-TGDR, a Multi-class Regularization Method, Identifies the Metabolic Profiles of Hepatocellular Carcinoma and Cirrhosis Infected with Hepatitis B or Hepatitis C Virus

Overview
Publisher Biomed Central
Specialty Biology
Date 2014 Apr 9
PMID 24707821
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Over the last decade, metabolomics has evolved into a mainstream enterprise utilized by many laboratories globally. Like other "omics" data, metabolomics data has the characteristics of a smaller sample size compared to the number of features evaluated. Thus the selection of an optimal subset of features with a supervised classifier is imperative. We extended an existing feature selection algorithm, threshold gradient descent regularization (TGDR), to handle multi-class classification of "omics" data, and proposed two such extensions referred to as multi-TGDR. Both multi-TGDR frameworks were used to analyze a metabolomics dataset that compares the metabolic profiles of hepatocellular carcinoma (HCC) infected with hepatitis B (HBV) or C virus (HCV) with that of cirrhosis induced by HBV/HCV infection; the goal was to improve early-stage diagnosis of HCC.

Results: We applied two multi-TGDR frameworks to the HCC metabolomics data that determined TGDR thresholds either globally across classes, or locally for each class. Multi-TGDR global model selected 45 metabolites with a 0% misclassification rate (the error rate on the training data) and had a 3.82% 5-fold cross-validation (CV-5) predictive error rate. Multi-TGDR local selected 48 metabolites with a 0% misclassification rate and a 5.34% CV-5 error rate.

Conclusions: One important advantage of multi-TGDR local is that it allows inference for determining which feature is related specifically to the class/classes. Thus, we recommend multi-TGDR local be used because it has similar predictive performance and requires the same computing time as multi-TGDR global, but may provide class-specific inference.

Citing Articles

GEE-TGDR: A Longitudinal Feature Selection Algorithm and Its Application to lncRNA Expression Profiles for Psoriasis Patients Treated with Immune Therapies.

Tian S, Wang C, Suarez-Farinas M Biomed Res Int. 2021; 2021:8862895.

PMID: 33928163 PMC: 8053058. DOI: 10.1155/2021/8862895.


Feature Selection for Longitudinal Data by Using Sign Averages to Summarize Gene Expression Values over Time.

Tian S, Wang C Biomed Res Int. 2019; 2019:1724898.

PMID: 31016185 PMC: 6444255. DOI: 10.1155/2019/1724898.


The metabolic fingerprints of HCV and HBV infections studied by Nuclear Magnetic Resonance Spectroscopy.

Meoni G, Lorini S, Monti M, Madia F, Corti G, Luchinat C Sci Rep. 2019; 9(1):4128.

PMID: 30858406 PMC: 6412048. DOI: 10.1038/s41598-019-40028-4.


A longitudinal feature selection method identifies relevant genes to distinguish complicated injury and uncomplicated injury over time.

Tian S, Wang C, Chang H BMC Med Inform Decis Mak. 2018; 18(Suppl 5):115.

PMID: 30526581 PMC: 6284265. DOI: 10.1186/s12911-018-0685-8.


To select relevant features for longitudinal gene expression data by extending a pathway analysis method.

Tian S, Wang C, Chang H F1000Res. 2018; 7:1166.

PMID: 30271585 PMC: 6124382. DOI: 10.12688/f1000research.15357.1.


References
1.
Ramadan Z, Jacobs D, Grigorov M, Kochhar S . Metabolic profiling using principal component analysis, discriminant partial least squares, and genetic algorithms. Talanta. 2008; 68(5):1683-91. DOI: 10.1016/j.talanta.2005.08.042. View

2.
Tian S, Suarez-Farinas M . Multi-TGDR: a regularization method for multi-class classification in microarray experiments. PLoS One. 2013; 8(11):e78302. PMC: 3833980. DOI: 10.1371/journal.pone.0078302. View

3.
Ma S, Huang J . Regularized ROC method for disease classification and biomarker selection with microarray data. Bioinformatics. 2005; 21(24):4356-62. DOI: 10.1093/bioinformatics/bti724. View

4.
Tian S, Krueger J, Li K, Jabbari A, Brodmerkel C, Lowes M . Meta-analysis derived (MAD) transcriptome of psoriasis defines the "core" pathogenesis of disease. PLoS One. 2012; 7(9):e44274. PMC: 3434204. DOI: 10.1371/journal.pone.0044274. View

5.
van der Greef J, Stroobant P, van der Heijden R . The role of analytical sciences in medical systems biology. Curr Opin Chem Biol. 2004; 8(5):559-65. DOI: 10.1016/j.cbpa.2004.08.013. View