Abstract
Polygenic risk scores (PRSs) quantify an individual's genetic susceptibility to complex traits and diseases. Conventional PRSs, which are based on linear models, perform poorly for phenotypes with skewed distributions or with genetic effects that vary across the distribution. We propose a quantile regression-based PRS (QPRS) that can capture quantile-specific genetic effects. While existing PRSs provide only a single score, QPRS models genetic influences at multiple quantiles of the phenotype, thereby enhancing predictive performance by utilizing these multiple scores as covariates. We evaluate the performance of our method through both simulations and a real-data application. In simulations, QPRS significantly reduces the mean squared error (MSE) compared to the conventional PRS, both in the presence of variance quantitative trait loci and outliers. For real data analysis, we use data from Korea Genome and Epidemiology Study (KoGES) to evaluate predictive performance. We consider two prediction tasks: a continuous outcome (glucose level) and a binary outcome (diabetes status, derived from glucose level). For glucose-level prediction, the model incorporating QPRS achieves a R2 value 4.69 times higher than the model using conventional PRSs. For predicting diabetes status, the model with QPRS produces an area under the curve 1.06 times higher than the model with conventional PRSs.
Links & Resources
Authors
Cite This Paper
S., K., T., G., T., P., M., P. (2025). Enhancing Polygenic Risk Prediction by Modeling Quantile-Specific Genetic Effects. arXiv preprint arXiv:10.64898/2025.12.25.25342935.
Kim, S., Goo, T., Park, T., and Park, M.. "Enhancing Polygenic Risk Prediction by Modeling Quantile-Specific Genetic Effects." arXiv preprint arXiv:10.64898/2025.12.25.25342935 (2025).