Research Article

The Relationship Between Polygenic Risk Scores and Clinical Phenotype in Patients with Phenylketonuria: Genetic Prediction with the Random Forest Model

Volume: 35 Number: 4 August 29, 2025
EN TR

The Relationship Between Polygenic Risk Scores and Clinical Phenotype in Patients with Phenylketonuria: Genetic Prediction with the Random Forest Model

Abstract

Summary: Background /Aims: Pathogenic variations in the PAH gene cause phenylketonuria (PKU), a monogenic metabolic disorder. Individuals with the same mutation often exhibit phenotypic variability despite the monogenic nature of the condition. The aim of this study is to create a prediction model using the Random Forest (RF) machine learning algorithm and to examine the relationship between polygenic risk scores (PRS) and phenotypic severity in PKU patients. Methods: In this study, clinical exome sequencing data obtained from 174 PKU patients with molecular validation were retrospectively examined. Approximately 18,000 common variants were retained after being filtered by population frequency and quality for individual-level analysis. All eligible variants (excluding PAH mutations) were used to calculate PRS, and RF (1000 trees, maximum depth = 5) was used for modeling. International criteria were used to classify patients into mild, moderate, and severe phenotypes. Pearson correlation and ROC analysis were used to evaluate the model's performance. Findings: The RF-based PRS model had a high accuracy rate in predicting phenotypic severity (AUC = 0.91, overall accuracy = 84.3%). There was a significant correlation between PRS values and the severity of the phenotype (r = 0.68, p < 0.001). Severe clinical phenotypes were more common in patients with higher PRS. Variants in genes associated with phenylalanine metabolism (e.g., GCH1, QDPR, PTS) were the most significant contributors to risk prediction according to feature importance analysis results. Results: The results indicate that PRS modeling combined with machine learning could be a useful method for predicting the severity of phenotypes in monogenic disorders such as PKU. This integrative approach highlights the regulatory effect of a polygenic background and suggests that PRS could support clinical risk assessment and personalized treatment plans. However, before clinical application, it is very important to validate in various populations.

Keywords

Phenylketonuria , polygenic risk score , genomic modeling , machine learning , phenotypic prediction

References

  1. 1. Berga-Švītiņa E, Miklaševičs E, Fischer K, Vilne B, Mägi R. Polygenic risk score predicts modified risk in BRCA1 pathogenic variant carriers in breast cancer patients. Cancers (Basel). 2023;15(11):2957. https://doi.org/10.3390/cancers15112957
  2. 2. Blau N, Longo N, van Spronsen FJ. PKU: Current management and future developments. Mol Genet Metab. 2021;132(1):1–12. https://doi.org/10.1016/j.ymgme.2021.04.003
  3. 3. Christoffersen M, Tybjærg‐Hansen A. Polygenic risk scores: How much do they add? Curr Opin Lipidol. 2021;32(3):157–62. https://doi.org/10.1097/mol.0000000000000759
  4. 4. De Vincentis A, Tavaglione F, Jamialahmadi O, et al. A polygenic risk score to refine risk stratification and prediction for severe liver disease by clinical fibrosis scores. Clin Gastroenterol Hepatol. 2022;20(3):658–73.e6. https://doi.org/10.1016/j.cgh.2021.05.056
  5. 5. Fahed AC, Wang M, Homburger JR, et al. Polygenic background modifies penetrance of monogenic variants for tier 1 genomic conditions. Nat Commun. 2020;11(1):3635. https://doi.org/10.1038/s41467-020-17374-3
  6. 6. Fang Y, Gao J, Guo Y, Li X, Yuan E, Zhang L. Allelic phenotype prediction of phenylketonuria based on the machine learning method. Hum Genomics. 2023;17(1). https://doi.org/10.1186/s40246-023-00481-9
  7. 7. Goodrich J, Singer‐Berk M, Son R, et al. Determinants of penetrance and variable expressivity in monogenic metabolic conditions across 77,184 exomes. Nat Commun. 2021;12(1). https://doi.org/10.1038/s41467-021-23556-4
  8. 8. Groenendyk JW, Ahmed ST, Fanidi A, et al. Implementation of population-based polygenic risk scores: A conference at the Pritchard Lab. BMC Med Genomics. 2021;14(1):91. https://doi.org/10.1186/s12920-021-00942-1
  9. 9. Groenendyk JW, Greenland P, Khan SS. Incremental value of polygenic risk scores in primary prevention of coronary heart disease. JAMA Intern Med. 2022;182(10):1082–9. https://doi.org/10.1001/jamainternmed.2022.3171
  10. 10. Honda S, Ikari K, Yano K, et al. Association of polygenic risk scores with radiographic progression in patients with rheumatoid arthritis. Arthritis Rheumatol. 2022;74(5):791–800. https://doi.org/10.1002/art.42051
Vancouver
1.Ebru Marzioğlu Özdemir, Ozkan Bagci. The Relationship Between Polygenic Risk Scores and Clinical Phenotype in Patients with Phenylketonuria: Genetic Prediction with the Random Forest Model. Genel Tıp Derg. 2025 Aug. 1;35(4):728-35. doi:10.54005/geneltip.1697947