» Articles » PMID: 38853961

The Polygenic Score Catalog: New Functionality and Tools to Enable FAIR Research

Abstract

Polygenic scores (PGS) have transformed human genetic research and have multiple potential clinical applications, including risk stratification for disease prevention and prediction of treatment response. Here, we present a series of recent enhancements to the PGS Catalog (www.PGSCatalog.org), the largest findable, accessible, interoperable, and reusable (FAIR) repository of PGS. These include expansions in data content and ancestral diversity as well as the addition of new features. We further present the PGS Catalog Calculator (pgsc_calc, https://github.com/PGScatalog/pgsc_calc), an open-source, scalable and portable pipeline to reproducibly calculate PGS that securely democratizes equitable PGS applications by implementing genetic ancestry estimation and score normalization using reference data. With the PGS Catalog & calculator users can now quantify an individual's genetic predisposition for hundreds of common diseases and clinically relevant traits. Taken together, these updates and tools facilitate the next generation of PGS, thus lowering barriers to the clinical studies necessary to identify where PGS may be integrated into clinical practice.

References
1.
Ruan Y, Lin Y, Feng Y, Chen C, Lam M, Guo Z . Improving polygenic prediction in ancestrally diverse populations. Nat Genet. 2022; 54(5):573-580. PMC: 9117455. DOI: 10.1038/s41588-022-01054-7. View

2.
Lambert S, Gil L, Jupp S, Ritchie S, Xu Y, Buniello A . The Polygenic Score Catalog as an open database for reproducibility and systematic evaluation. Nat Genet. 2021; 53(4):420-425. PMC: 11165303. DOI: 10.1038/s41588-021-00783-5. View

3.
Zhang D, Dey R, Lee S . Fast and robust ancestry prediction using principal component analysis. Bioinformatics. 2020; 36(11):3439-3446. PMC: 7267814. DOI: 10.1093/bioinformatics/btaa152. View

4.
Chang C, Chow C, Tellier L, Vattikuti S, Purcell S, Lee J . Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience. 2015; 4:7. PMC: 4342193. DOI: 10.1186/s13742-015-0047-8. View

5.
Prive F, Arbel J, Vilhjalmsson B . LDpred2: better, faster, stronger. Bioinformatics. 2020; 36(22-23):5424-5431. PMC: 8016455. DOI: 10.1093/bioinformatics/btaa1029. View