EN
TR
Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach
Abstract
In large-scale observational datasets, associations between clinical outcomes and biomarkers are highly sensitive to confounding control and statistical modeling strategies. This study aimed to evaluate how confounding-control and variable-selection strategies influence the observed association between retinopathy and liver fibrosis in a large observational dataset, thereby providing a methodological demonstration of how statistical modeling decisions may influence epidemiological inference. Data were obtained from the National Health and Nutrition Examination Survey (NHANES) 2005-2008 cycles. Liver fibrosis was defined using the FIB-4 index and classified as no/mild fibrosis (FIB-4 < 1.45) or significant fibrosis/cirrhosis (FIB-4 ≥ 1.45). Retinopathy status and severity were determined using standardized retinal imaging data. To improve baseline comparability between fibrosis groups, propensity score matching (PSM) was implemented based on age and sex, and covariate balance was assessed using standardized mean differences. Variable selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) with 10-fold cross-validation, and the optimal penalty parameter (λ) was selected based on the minimum cross-validated error (λmin), followed by multivariable logistic regression modeling. Variables used in the calculation of the FIB-4 index were excluded from regression analyses to avoid circular inference. A total of 5,364 participants were included. Before matching, substantial imbalance was observed between fibrosis groups, particularly for age and sex. After 1:1 propensity score matching, adequate covariate balance was achieved. LASSO-based variable selection identified hepatitis C virus infection, body mass index, height, race, and retinopathy as candidate predictors of liver fibrosis. In the final multivariable logistic regression model, hepatitis C virus infection showed the strongest association with significant liver fibrosis (OR = 2.70, 95% CI: 1.66-4.39), while body mass index and height demonstrated modest but statistically significant associations. Retinopathy was not independently associated with liver fibrosis after multivariable adjustment. The results demonstrate that the apparent association between retinopathy and liver fibrosis in observational data is highly dependent on statistical modeling choices and confounding-control strategies. Rather than supporting a direct clinical relationship, the findings emphasize how analytical design and variable selection methods can substantially shape conclusions derived from large-scale health datasets. From a methodological perspective, this study illustrates how confounding-control and variable-selection strategies can alter epidemiological inference in observational research. This study highlights the importance of transparent and rigorously justified statistical modeling frameworks in applied data-driven research.
Keywords
Ethical Statement
Ethics committee approval was not required for this study because it was conducted using publicly available, de-identified secondary data. The data were obtained from the National Health and Nutrition Examination Survey (NHANES), which is conducted in accordance with ethical standards and approved by the National Center for Health Statistics Research Ethics Review Board. No identifiable human data were used in this analysis.
References
- Asero, C., Giandalia, A., Cacciola, I., Morace, C., Lorello, G., Caspanello, A. R., Alibrandi, A., Squadrito, G., & Russo, G. T. (2023). High Prevalence of Severe Hepatic Fibrosis in Type 2 Diabetic Outpatients Screened for Non-Alcoholic Fatty Liver Disease. J Clin Med, 12(8). https://doi.org/10.3390/jcm12082858
- Austin, P. C. (2011). An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate behavioral research, 46(3), 399-424.
- Bangchang, K. N. (2024). Application of Bayesian variable selection in logistic regression model. AIMS Mathematics, 9(5), 13336-13345. https://doi.org/10.3934/math.2024650
- Bo, J., & Yang, M. (2025). Dose-response relationship between physical activity and visceral fat mass: a cross-sectional study based on NHANES 2011-2018. BMC Public Health, 25(1), 3113. https://doi.org/10.1186/s12889-025-24393-6
- Burgos-Ochoa, L., & Clouth, F. J. (2025). Causal Inference and Survey Data in Paediatric Epidemiology: Generalising Treatment Effects From Observational Data. Paediatr Perinat Epidemiol. https://doi.org/10.1111/ppe.70042
- Cheang, I., Zhu, X., Zhu, Q., Li, M., Liao, S., Zuo, Z., Yao, W., Zhou, Y., Zhang, H., & Li, X. (2022). Inverse association between blood ethylene oxide levels and obesity in the general population: NHANES 2013-2016. Front Endocrinol (Lausanne), 13, 926971. https://doi.org/10.3389/fendo.2022.926971
- Chhabra, S., Singh, S. P., Singh, A., Mehta, V., Kaur, A., Bansal, N., & Sood, A. (2022). Diabetes Mellitus Increases the Risk of Significant Hepatic Fibrosis in Patients With Non-alcoholic Fatty Liver Disease. J Clin Exp Hepatol, 12(2), 409-416. https://doi.org/10.1016/j.jceh.2021.07.001
- Deravi, N., Dehghani Firouzabadi, F., Moosaie, F., Asadigandomani, H., Arab Bafrani, M., Yoosefi, N., Poopak, A., Dehghani Firouzabadi, M., Poudineh, M., Rabizadeh, S., Kamel, I., Nakhjavani, M., & Esteghamati, A. (2023). Non-alcoholic fatty liver disease and incidence of microvascular complications of diabetes in patients with type 2 diabetes: a prospective cohort study. Front Endocrinol (Lausanne), 14, 1147458. https://doi.org/10.3389/fendo.2023.1147458
Details
Primary Language
English
Subjects
Biostatistics
Journal Section
Research Article
Publication Date
May 15, 2026
Submission Date
February 3, 2026
Acceptance Date
April 25, 2026
Published in Issue
Year 2026 Volume: 9 Number: 3
APA
Cantaş Türkiş, F., & Varol, B. (2026). Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach. Black Sea Journal of Engineering and Science, 9(3), 1301-1311. https://doi.org/10.34248/bsengineering.1880946
AMA
1.Cantaş Türkiş F, Varol B. Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach. BSJ Eng. Sci. 2026;9(3):1301-1311. doi:10.34248/bsengineering.1880946
Chicago
Cantaş Türkiş, Fulden, and Buğra Varol. 2026. “Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach”. Black Sea Journal of Engineering and Science 9 (3): 1301-11. https://doi.org/10.34248/bsengineering.1880946.
EndNote
Cantaş Türkiş F, Varol B (May 1, 2026) Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach. Black Sea Journal of Engineering and Science 9 3 1301–1311.
IEEE
[1]F. Cantaş Türkiş and B. Varol, “Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach”, BSJ Eng. Sci., vol. 9, no. 3, pp. 1301–1311, May 2026, doi: 10.34248/bsengineering.1880946.
ISNAD
Cantaş Türkiş, Fulden - Varol, Buğra. “Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach”. Black Sea Journal of Engineering and Science 9/3 (May 1, 2026): 1301-1311. https://doi.org/10.34248/bsengineering.1880946.
JAMA
1.Cantaş Türkiş F, Varol B. Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach. BSJ Eng. Sci. 2026;9:1301–1311.
MLA
Cantaş Türkiş, Fulden, and Buğra Varol. “Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach”. Black Sea Journal of Engineering and Science, vol. 9, no. 3, May 2026, pp. 1301-1, doi:10.34248/bsengineering.1880946.
Vancouver
1.Fulden Cantaş Türkiş, Buğra Varol. Influence of Confounding Control on Liver Fibrosis Modeling: A Propensity Score Matching and Lasso-Based Logistic Regression Approach. BSJ Eng. Sci. 2026 May 1;9(3):1301-1. doi:10.34248/bsengineering.1880946