» Articles » PMID: 39991713

Systematic Collection, Annotation, and Pattern Analysis of Viral Vaccines in the VIOLIN Vaccine Knowledgebase

Overview
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Viral vaccines have been proven significant in protecting us against viral diseases such as COVID-19. To better understand and design viral vaccines, it is critical to systematically collect, annotate, and analyse various viral vaccines and identify enriched patterns from these viral vaccines.

Methods: We systematically collected experimentally verified viral vaccines from the literature, manually annotated, and stored the information in the VIOLIN vaccine database. The annotated information included basic vaccine names, pathogens and diseases, vaccine components, vaccine formulations, and their induced host responses. Enriched patterns were identified from our systematical analysis of the viral vaccines and vaccine antigens.

Results: A total of 2,847 viral vaccines against 95 viral species (including 72 RNA viral species and 23 DNA viral species) were collected, manually annotated, and stored in the VIOLIN vaccine database. These viral vaccines used 542 vaccine antigens. A taxonomical analysis found various DNA and RNA viruses covered by the viral vaccines. These vaccines target different viral life cycle stages (e.g., viral entry, assembly, exit, and immune evasion) as identified in top ranked human, animal vaccines, and HPV vaccines. The vaccine antigen proteins also show up in different virion locations in viruses such as HRSV vaccines. Both structural and non-structural viral proteins have been used for viral vaccine development. Protective vaccine antigens tend to have a protegenicity score of >85% based on the Vaxign-ML calculation, which measures predicted suitability for vaccine use. While predicted adhesins still have significantly higher chances of being protective antigens, only 21.42% of protective viral vaccine antigens were predicted to be adhesins. Furthermore, our Gene Ontology (GO) enrichment analysis using a customized Fisher's exact test identified many enriched patterns such as viral entry into the host cell, DNA/RNA/ATP/ion binding, and suppression of host type 1 interferon-mediated signaling pathway. The viral vaccines and their associated entities and relations are ontologically modeled and represented in the Vaccine Ontology (VO). A VIOLIN web interface was developed to support user friendly queries of viral vaccines.

Discussion: Viral vaccines were systematically collected and annotated in the VIOLIN vaccine knowledgebase, and the analysis of these viral vaccines identified many insightful patterns.

References
1.
Shi L, Sings H, Bryan J, Wang B, Wang Y, Mach H . GARDASIL: prophylactic human papillomavirus vaccine development--from bench top to bed-side. Clin Pharmacol Ther. 2007; 81(2):259-64. DOI: 10.1038/sj.clpt.6100055. View

2.
Shu Y, McCauley J . GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. 2017; 22(13). PMC: 5388101. DOI: 10.2807/1560-7917.ES.2017.22.13.30494. View

3.
Ong E, He Y . Vaccine Design by Reverse Vaccinology and Machine Learning. Methods Mol Biol. 2021; 2414:1-16. DOI: 10.1007/978-1-0716-1900-1_1. View

4.
Amarasinghe G, Ayllon M, Bao Y, Basler C, Bavari S, Blasdell K . Taxonomy of the order Mononegavirales: update 2019. Arch Virol. 2019; 164(7):1967-1980. PMC: 6641539. DOI: 10.1007/s00705-019-04247-4. View

5.
Yang B, Sayers S, Xiang Z, He Y . Protegen: a web-based protective antigen database and analysis system. Nucleic Acids Res. 2010; 39(Database issue):D1073-8. PMC: 3013795. DOI: 10.1093/nar/gkq944. View