» Articles » PMID: 29474353

Deep Sequencing of HBV Pre-S Region Reveals High Heterogeneity of HBV Genotypes and Associations of Word Pattern Frequencies with HCC

Overview
Journal PLoS Genet
Specialty Genetics
Date 2018 Feb 24
PMID 29474353
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Hepatitis B virus (HBV) infection is a common problem in the world, especially in China. More than 60-80% of hepatocellular carcinoma (HCC) cases can be attributed to HBV infection in high HBV prevalent regions. Although traditional Sanger sequencing has been extensively used to investigate HBV sequences, NGS is becoming more commonly used. Further, it is unknown whether word pattern frequencies of HBV reads by Next Generation Sequencing (NGS) can be used to investigate HBV genotypes and predict HCC status. In this study, we used NGS to sequence the pre-S region of the HBV sequence of 94 HCC patients and 45 chronic HBV (CHB) infected individuals. Word pattern frequencies among the sequence data of all individuals were calculated and compared using the Manhattan distance. The individuals were grouped using principal coordinate analysis (PCoA) and hierarchical clustering. Word pattern frequencies were also used to build prediction models for HCC status using both K-nearest neighbors (KNN) and support vector machine (SVM). We showed the extremely high power of analyzing HBV sequences using word patterns. Our key findings include that the first principal coordinate of the PCoA analysis was highly associated with the fraction of genotype B (or C) sequences and the second principal coordinate was significantly associated with the probability of having HCC. Hierarchical clustering first groups the individuals according to their major genotypes followed by their HCC status. Using cross-validation, high area under the receiver operational characteristic curve (AUC) of around 0.88 for KNN and 0.92 for SVM were obtained. In the independent data set of 46 HCC patients and 31 CHB individuals, a good AUC score of 0.77 was obtained using SVM. It was further shown that 3000 reads for each individual can yield stable prediction results for SVM. Thus, another key finding is that word patterns can be used to predict HCC status with high accuracy. Therefore, our study shows clearly that word pattern frequencies of HBV sequences contain much information about the composition of different HBV genotypes and the HCC status of an individual.

Citing Articles

Current status and new directions for hepatocellular carcinoma diagnosis.

Tu J, Wang B, Wang X, Huo K, Hu W, Zhang R Liver Res. 2025; 8(4):218-236.

PMID: 39958920 PMC: 11771281. DOI: 10.1016/j.livres.2024.12.001.


Host and Viral Factors Influencing Chronic Hepatitis B Infection Across Three Generations in a Family.

Naderi M, Hosseini S, Behnampour N, Besharat S, Shahramian I, Khoshnia M Curr Microbiol. 2024; 81(12):446.

PMID: 39499325 DOI: 10.1007/s00284-024-03963-8.


Genotyping Hepatitis B virus by Next-Generation Sequencing: Detection of Mixed Infections and Analysis of Sequence Conservation.

Dopico E, Vila M, Tabernero D, Gregori J, Rando-Segura A, Pacin-Ruiz B Int J Mol Sci. 2024; 25(10).

PMID: 38791519 PMC: 11122360. DOI: 10.3390/ijms25105481.


Sparse logistic regression revealed the associations between HBV PreS quasispecies and hepatocellular carcinoma.

Jia J, Zhang S, Bai X, Fang M, Chen S, Liang X Virol J. 2022; 19(1):114.

PMID: 35765099 PMC: 9238101. DOI: 10.1186/s12985-022-01836-9.


Application of ultrasound combined with enhanced MRI by Gd-BOPTA in diagnosing hepatocellular carcinoma.

Ji S, Wang Z, Xia S Am J Transl Res. 2021; 13(6):7172-7178.

PMID: 34306478 PMC: 8290690.


References
1.
Pollicino T, Cacciola I, Saffioti F, Raimondo G . Hepatitis B virus PreS/S gene variants: pathobiology and clinical implications. J Hepatol. 2014; 61(2):408-17. DOI: 10.1016/j.jhep.2014.04.041. View

2.
Vinga S, Almeida J . Alignment-free sequence comparison-a review. Bioinformatics. 2003; 19(4):513-23. DOI: 10.1093/bioinformatics/btg005. View

3.
Yan Y, Su H, Ji Z, Shao Z, Pu Z . Epidemiology of Hepatitis B Virus Infection in China: Current Status and Challenges. J Clin Transl Hepatol. 2015; 2(1):15-22. PMC: 4521251. DOI: 10.14218/JCTH.2013.00030. View

4.
Tanaka Y, Mukaide M, Orito E, Yuen M, Ito K, Kurbanov F . Specific mutations in enhancer II/core promoter of hepatitis B virus subgenotypes C1/C2 increase the risk of hepatocellular carcinoma. J Hepatol. 2006; 45(5):646-53. DOI: 10.1016/j.jhep.2006.06.018. View

5.
Deng K, Pertea M, Rongvaux A, Wang L, Durand C, Ghiaur G . Broad CTL response is required to clear latent HIV-1 due to dominance of escape mutations. Nature. 2015; 517(7534):381-5. PMC: 4406054. DOI: 10.1038/nature14053. View