» Articles » PMID: 36307547

Integrating Internet Multisource Big Data to Predict the Occurrence and Development of COVID-19 Cryptic Transmission

Overview
Journal NPJ Digit Med
Date 2022 Oct 28
PMID 36307547
Authors
Affiliations
Soon will be listed here.
Abstract

With the recent prevalence of COVID-19, cryptic transmission is worthy of attention and research. Early perception of the occurrence and development risk of cryptic transmission is an important part of controlling the spread of COVID-19. Previous relevant studies have limited data sources, and no effective analysis has been carried out on the occurrence and development of cryptic transmission. Hence, we collect Internet multisource big data (including retrieval, migration, and media data) and propose comprehensive and relative application strategies to eliminate the impact of national and media data. We use statistical classification and regression to construct an early warning model for occurrence and development. Under the guidance of the improved coronavirus herd immunity optimizer (ICHIO), we construct a "sampling-feature-hyperparameter-weight" synchronous optimization strategy. In occurrence warning, we propose an undersampling synchronous evolutionary ensemble (USEE); in development warning, we propose a bootstrap-sampling synchronous evolutionary ensemble (BSEE). Regarding the internal training data (Heilongjiang Province), the ROC-AUC of USEE3 incorporating multisource data is 0.9553, the PR-AUC is 0.8327, and the R of BSEE2 fused by the "nonlinear + linear" method is 0.8698. Regarding the external validation data (Shaanxi Province), the ROC-AUC and PR-AUC values of USEE3 were 0.9680 and 0.9548, respectively, and the R of BSEE2 was 0.8255. Our method has good accuracy and generalization and can be flexibly used in the prediction of cryptic transmission in various regions. We propose strategy research that integrates multiple early warning tasks based on multisource Internet big data and combines multiple ensemble models. It is an extension of the research in the field of traditional infectious disease monitoring and has important practical significance and innovative theoretical value.

Citing Articles

Construction of a prediction and visualization system for cognitive impairment in elderly COPD patients based on self-assigning feature weights and residual evolution model.

Cheng W, Yu C, Liu X Front Artif Intell. 2025; 8:1473223.

PMID: 39991464 PMC: 11842389. DOI: 10.3389/frai.2025.1473223.


Global infectious disease early warning models: An updated review and lessons from the COVID-19 pandemic.

Hu W, Sun H, Wei Y, Hao Y Infect Dis Model. 2025; 10(2):410-422.

PMID: 39816751 PMC: 11731462. DOI: 10.1016/j.idm.2024.12.001.


Internet-based Surveillance Systems and Infectious Diseases Prediction: An Updated Review of the Last 10 Years and Lessons from the COVID-19 Pandemic.

McClymont H, Lambert S, Barr I, Vardoulakis S, Bambrick H, Hu W J Epidemiol Glob Health. 2024; 14(3):645-657.

PMID: 39141074 PMC: 11442909. DOI: 10.1007/s44197-024-00272-y.


Identification of an immune-related eRNA prognostic signature for clear cell renal cell carcinoma.

Lv Y, Niu L, Li Q, Shao W, Yan X, Li Y Aging (Albany NY). 2024; 16(3):2232-2248.

PMID: 38289619 PMC: 10911372. DOI: 10.18632/aging.205479.


Deep evolutionary fusion neural network: a new prediction standard for infectious disease incidence rates.

Yao T, Chen X, Wang H, Gao C, Chen J, Yi D BMC Bioinformatics. 2024; 25(1):38.

PMID: 38262917 PMC: 10804580. DOI: 10.1186/s12859-023-05621-5.


References
1.
Al-Betar M, Alyasseri Z, Awadallah M, Doush I . Coronavirus herd immunity optimizer (CHIO). Neural Comput Appl. 2020; 33(10):5011-5042. PMC: 7451802. DOI: 10.1007/s00521-020-05296-6. View

2.
Wang M, Tang N . The correlation between Google trends and salmonellosis. BMC Public Health. 2021; 21(1):1575. PMC: 8379030. DOI: 10.1186/s12889-021-11615-w. View

3.
Marcelin J, Cortes-Penfield N, Del Rio C, Desai A, Echenique I, Granwehr B . How the Field of Infectious Diseases Can Leverage Digital Strategy and Social Media Use During a Pandemic. Open Forum Infect Dis. 2021; 8(2):ofab027. PMC: 7896640. DOI: 10.1093/ofid/ofab027. View

4.
Jiang B, Zhu H, Zhang J, Yan C, Shen R . Investor Sentiment and Stock Returns During the COVID-19 Pandemic. Front Psychol. 2021; 12:708537. PMC: 8329237. DOI: 10.3389/fpsyg.2021.708537. View

5.
Nabeshima T, Takazono T, Ashizawa N, Miyazaki T, Inoue S, Ngwe Tun M . COVID-19 cryptic transmission and genetic information blackouts: Need for effective surveillance policy to better understand disease burden. Lancet Reg Health West Pac. 2021; 7:100104. PMC: 7882913. DOI: 10.1016/j.lanwpc.2021.100104. View