» Articles » PMID: 32693798

Semi-Parallel Logistic Regression for GWAS on Encrypted Data

Overview
Publisher Biomed Central
Specialty Genetics
Date 2020 Jul 23
PMID 32693798
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The sharing of biomedical data is crucial to enable scientific discoveries across institutions and improve health care. For example, genome-wide association studies (GWAS) based on a large number of samples can identify disease-causing genetic variants. The privacy concern, however, has become a major hurdle for data management and utilization. Homomorphic encryption is one of the most powerful cryptographic primitives which can address the privacy and security issues. It supports the computation on encrypted data, so that we can aggregate data and perform an arbitrary computation on an untrusted cloud environment without the leakage of sensitive information.

Methods: This paper presents a secure outsourcing solution to assess logistic regression models for quantitative traits to test their associations with genotypes. We adapt the semi-parallel training method by Sikorska et al., which builds a logistic regression model for covariates, followed by one-step parallelizable regressions on all individual single nucleotide polymorphisms (SNPs). In addition, we modify our underlying approximate homomorphic encryption scheme for performance improvement.

Results: We evaluated the performance of our solution through experiments on real-world dataset. It achieves the best performance of homomorphic encryption system for GWAS analysis in terms of both complexity and accuracy. For example, given a dataset consisting of 245 samples, each of which has 10643 SNPs and 3 covariates, our algorithm takes about 43 seconds to perform logistic regression based genome wide association analysis over encryption.

Conclusions: We demonstrate the feasibility and scalability of our solution.

Citing Articles

Secure and scalable gene expression quantification with pQuant.

Hong S, Walker C, Choi Y, Gursoy G Nat Commun. 2025; 16(1):2380.

PMID: 40064866 PMC: 11894182. DOI: 10.1038/s41467-025-57393-6.


Private detection of relatives in forensic genomics using homomorphic encryption.

de Souza F, de Lassus H, Cammarota R BMC Med Genomics. 2024; 17(1):273.

PMID: 39563334 PMC: 11575431. DOI: 10.1186/s12920-024-02037-9.


Genomic privacy preservation in genome-wide association studies: taxonomy, limitations, challenges, and vision.

Aherrahrou N, Tairi H, Aherrahrou Z Brief Bioinform. 2024; 25(5.

PMID: 39073827 PMC: 11285165. DOI: 10.1093/bib/bbae356.


Privacy-preserving model evaluation for logistic and linear regression using homomorphically encrypted genotype data.

Hong S, Choi Y, Joo D, Gursoy G J Biomed Inform. 2024; 156:104678.

PMID: 38936565 PMC: 11272436. DOI: 10.1016/j.jbi.2024.104678.


SVAT: Secure outsourcing of variant annotation and genotype aggregation.

Kim M, Wang S, Jiang X, Harmanci A BMC Bioinformatics. 2022; 23(1):409.

PMID: 36182914 PMC: 9526274. DOI: 10.1186/s12859-022-04959-6.


References
1.
Hug C, Szolovits P . ICU acuity: real-time models versus daily models. AMIA Annu Symp Proc. 2010; 2009:260-4. PMC: 2815497. View

2.
Bonte C, Makri E, ArdeshirDavani A, Simm J, Moreau Y, Vercauteren F . Towards practical privacy-preserving genome-wide association study. BMC Bioinformatics. 2018; 19(1):537. PMC: 6302495. DOI: 10.1186/s12859-018-2541-3. View

3.
Sikorska K, Lesaffre E, Groenen P, Eilers P . GWAS on your notebook: fast semi-parallel linear and logistic regression for genome-wide association studies. BMC Bioinformatics. 2013; 14:166. PMC: 3695771. DOI: 10.1186/1471-2105-14-166. View

4.
Kim M, Lauter K . Private genome analysis through homomorphic encryption. BMC Med Inform Decis Mak. 2016; 15 Suppl 5:S3. PMC: 4699052. DOI: 10.1186/1472-6947-15-S5-S3. View

5.
Kim A, Song Y, Kim M, Lee K, Cheon J . Logistic regression model training based on the approximate homomorphic encryption. BMC Med Genomics. 2018; 11(Suppl 4):83. PMC: 6180367. DOI: 10.1186/s12920-018-0401-7. View