» Articles » PMID: 35861678

Developing a Long COVID Phenotype for Postacute COVID-19 in a National Primary Care Sentinel Cohort: Observational Retrospective Database Analysis

Abstract

Background: Following COVID-19, up to 40% of people have ongoing health problems, referred to as postacute COVID-19 or long COVID (LC). LC varies from a single persisting symptom to a complex multisystem disease. Research has flagged that this condition is underrecorded in primary care records, and seeks to better define its clinical characteristics and management. Phenotypes provide a standard method for case definition and identification from routine data and are usually machine-processable. An LC phenotype can underpin research into this condition.

Objective: This study aims to develop a phenotype for LC to inform the epidemiology and future research into this condition. We compared clinical symptoms in people with LC before and after their index infection, recorded from March 1, 2020, to April 1, 2021. We also compared people recorded as having acute infection with those with LC who were hospitalized and those who were not.

Methods: We used data from the Primary Care Sentinel Cohort (PCSC) of the Oxford Royal College of General Practitioners (RCGP) Research and Surveillance Centre (RSC) database. This network was recruited to be nationally representative of the English population. We developed an LC phenotype using our established 3-step ontological method: (1) ontological step (defining the reasoning process underpinning the phenotype, (2) coding step (exploring what clinical terms are available, and (3) logical extract model (testing performance). We created a version of this phenotype using Protégé in the ontology web language for BioPortal and using PhenoFlow. Next, we used the phenotype to compare people with LC (1) with regard to their symptoms in the year prior to acquiring COVID-19 and (2) with people with acute COVID-19. We also compared hospitalized people with LC with those not hospitalized. We compared sociodemographic details, comorbidities, and Office of National Statistics-defined LC symptoms between groups. We used descriptive statistics and logistic regression.

Results: The long-COVID phenotype differentiated people hospitalized with LC from people who were not and where no index infection was identified. The PCSC (N=7.4 million) includes 428,479 patients with acute COVID-19 diagnosis confirmed by a laboratory test and 10,772 patients with clinically diagnosed COVID-19. A total of 7471 (1.74%, 95% CI 1.70-1.78) people were coded as having LC, 1009 (13.5%, 95% CI 12.7-14.3) had a hospital admission related to acute COVID-19, and 6462 (86.5%, 95% CI 85.7-87.3) were not hospitalized, of whom 2728 (42.2%) had no COVID-19 index date recorded. In addition, 1009 (13.5%, 95% CI 12.73-14.28) people with LC were hospitalized compared to 17,993 (4.5%, 95% CI 4.48-4.61; P<.001) with uncomplicated COVID-19.

Conclusions: Our LC phenotype enables the identification of individuals with the condition in routine data sets, facilitating their comparison with unaffected people through retrospective research. This phenotype and study protocol to explore its face validity contributes to a better understanding of LC.

Citing Articles

Enhancing long COVID care in general practice: A qualitative study.

Broughan J, Sietins E, Siu K, Clendennen N, Collins C, Fawsitt R PLoS One. 2024; 19(6):e0306077.

PMID: 38924005 PMC: 11207167. DOI: 10.1371/journal.pone.0306077.


Phenotype execution and modeling architecture to support disease surveillance and real-world evidence studies: English sentinel network evaluation.

Jamie G, Elson W, Kar D, Wimalaratna R, Hoang U, Meza-Torres B JAMIA Open. 2024; 7(2):ooae034.

PMID: 38737141 PMC: 11087727. DOI: 10.1093/jamiaopen/ooae034.


Impact of Pre-Infection COVID-19 Vaccination on the Incidence and Severity of Post-COVID Syndrome: A Systematic Review and Meta-Analysis.

Man M, Rosca D, Bratosin F, Fira-Mladinescu O, Ilie A, Burtic S Vaccines (Basel). 2024; 12(2).

PMID: 38400172 PMC: 10893048. DOI: 10.3390/vaccines12020189.


Computable Phenotypes for Post-acute sequelae of SARS-CoV-2: A National COVID Cohort Collaborative Analysis.

Pungitore S, Olorunnisola T, Mosier J, Subbian V AMIA Annu Symp Proc. 2024; 2023:589-598.

PMID: 38222385 PMC: 10785914.


Advancing the Management of Long COVID by Integrating into Health Informatics Domain: Current and Future Perspectives.

Ambalavanan R, Snead R, Marczika J, Kozinsky K, Aman E Int J Environ Res Public Health. 2023; 20(19).

PMID: 37835106 PMC: 10572294. DOI: 10.3390/ijerph20196836.


References
1.
Espinosa-Gonzalez A, Neves A, Fiorentino F, Prociuk D, Husain L, Ramtale S . Predicting Risk of Hospital Admission in Patients With Suspected COVID-19 in a Community Setting: Protocol for Development and Validation of a Multivariate Risk Prediction Tool. JMIR Res Protoc. 2021; 10(5):e29072. PMC: 8153031. DOI: 10.2196/29072. View

2.
Sudre C, Murray B, Varsavsky T, Graham M, Penfold R, Bowyer R . Attributes and predictors of long COVID. Nat Med. 2021; 27(4):626-631. PMC: 7611399. DOI: 10.1038/s41591-021-01292-y. View

3.
Musen M . The Protégé Project: A Look Back and a Look Forward. AI Matters. 2016; 1(4):4-12. PMC: 4883684. DOI: 10.1145/2757001.2757003. View

4.
Papez V, Denaxas S, Hemingway H . Evaluation of Semantic Web Technologies for Storing Computable Definitions of Electronic Health Records Phenotyping Algorithms. AMIA Annu Symp Proc. 2018; 2017:1352-1361. PMC: 5977586. View

5.
Brat G, Weber G, Gehlenborg N, Avillach P, Palmer N, Chiovato L . International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium. NPJ Digit Med. 2020; 3:109. PMC: 7438496. DOI: 10.1038/s41746-020-00308-0. View