» Articles » PMID: 28637337

Matrix Completion with Side Information and Its Applications in Predicting the Antigenicity of Influenza Viruses

Overview
Journal Bioinformatics
Specialty Biology
Date 2017 Jun 23
PMID 28637337
Citations 19
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Low-rank matrix completion has been demonstrated to be powerful in predicting antigenic distances among influenza viruses and vaccines from partially revealed hemagglutination inhibition table. Meanwhile, influenza hemagglutinin (HA) protein sequences are also effective in inferring antigenic distances. Thus, it is natural to integrate HA protein sequence information into low-rank matrix completion model to help infer influenza antigenicity, which is critical to influenza vaccine development.

Results: We have proposed a novel algorithm called biological matrix completion with side information (BMCSI), which first measures HA protein sequence similarities among influenza viruses (especially on epitopes) and then integrates the similarity information into a low-rank matrix completion model to predict influenza antigenicity. This algorithm exploits both the correlations among viruses and vaccines in serological tests and the power of HA sequence in predicting influenza antigenicity. We applied this model into H3N2 seasonal influenza virus data. Comparing to previous methods, we significantly reduced the prediction root-mean-square error in a 10-fold cross validation analysis. Based on the cartographies constructed from imputed data, we showed that the antigenic evolution of H3N2 seasonal influenza is generally S-shaped while the genetic evolution is half-circle shaped. We also showed that the Spearman correlation between genetic and antigenic distances (among antigenic clusters) is 0.83, demonstrating a globally high correspondence and some local discrepancies between influenza genetic and antigenic evolution. Finally, we showed that 4.4%±1.2% genetic variance (corresponding to 3.11 ± 1.08 antigenic distances) caused an antigenic drift event for H3N2 influenza viruses historically.

Availability And Implementation: The software and data for this study are available at http://bi.sky.zstu.edu.cn/BMCSI/.

Contact: jialiang.yang@mssm.edu or pinganhe@zstu.edu.cn.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

MetaFluAD: meta-learning for predicting antigenic distances among influenza viruses.

Jia Q, Xia Y, Dong F, Li W Brief Bioinform. 2024; 25(5).

PMID: 39129362 PMC: 11317534. DOI: 10.1093/bib/bbae395.


Emvirus: An embedding-based neural framework for human-virus protein-protein interactions prediction.

Xie P, Zhuang J, Tian G, Yang J Biosaf Health. 2023; 5(3):152-158.

PMID: 37362223 PMC: 10166638. DOI: 10.1016/j.bsheal.2023.04.003.


MNNMDA: Predicting human microbe-disease association via a method to minimize matrix nuclear norm.

Liu H, Bing P, Zhang M, Tian G, Ma J, Li H Comput Struct Biotechnol J. 2023; 21:1414-1423.

PMID: 36824227 PMC: 9941872. DOI: 10.1016/j.csbj.2022.12.053.


Identifying lncRNA-disease association based on GAT multiple-operator aggregation and inductive matrix completion.

Zhang Y, Wang Y, Li X, Liu Y, Chen M Front Genet. 2022; 13:1029300.

PMID: 36338997 PMC: 9631210. DOI: 10.3389/fgene.2022.1029300.


Identifying potential microRNA biomarkers for colon cancer and colorectal cancer through bound nuclear norm regularization.

Zhai S, Li X, Wu Y, Shi X, Ji B, Qiu C Front Genet. 2022; 13:980437.

PMID: 36313468 PMC: 9614659. DOI: 10.3389/fgene.2022.980437.