SCADA-Based AC Power Forecasting in a Utility-Scale PV Plant: Explainable HistGradientBoosting and Thermal Derating Quantification
Abstract
Accurate photovoltaic (PV) power modeling is essential for grid operation, short-term scheduling and plant-level decision making. However, reported performance in the literature varies widely due to differences in feature selection, forecast horizon definition and evaluation protocols. In particular, true exogenous forecasting is often conflated with contemporaneous power tracking, leading to optimistic accuracy estimates with limited operational relevance. This study aims to clarify how feature availability, evaluation design and operational intent shape reported performance in PV power modeling. A leakage-aware and operationally realistic framework is proposed that explicitly distinguishes between exogenous forecasting and operational tracking tasks. Using high-frequency SCADA data from a utility-scale PV plant, two modeling scenarios are evaluated. Scenario A relies exclusively on exogenous meteorological inputs and represents a forecasting-oriented inference task, whereas Scenario B augments these inputs with synchronous electrical current measurements and is interpreted as operational tracking (nowcasting) rather than forward-looking prediction. Multiple regression models, with emphasis on tree-based ensemble methods, are evaluated under transparent and scenario-consistent train–test splitting strategies. Model behavior is examined using post-hoc explainability tools to assess physical consistency and temperature-induced power derating is quantified using a semiparametric regression approach. The results indicate that exogenous-only models recover physically meaningful relationships but exhibit higher numerical errors than current-augmented configurations, reflecting the informational advantage of synchronous measurements rather than improved forecasting capability. Rather than claiming superior numerical accuracy, this work demonstrates how modeling assumptions and evaluation choices directly influence reported performance and its interpretation. By aligning feature availability with operational intent, the proposed framework provides practical guidance for the development, evaluation and deployment of PV power models in real-world applications.
Keywords
Supporting Institution
Ethical Statement
Thanks
References
- Alcañiz, A., Grzebyk, D., Ziar, H., & Isabella, O. (2023). Trends and gaps in photovoltaic power forecasting with machine learning. Energy Reports, 9, 447–471. https://doi.org/10.1016/J.EGYR.2022.11.208
- Antonanzas, J., Osorio, N., Escobar, R., Urraca, R., Martinez-de-Pison, F. J., & Antonanzas-Torres, F. (2016). Review of photovoltaic power forecasting. Solar Energy, 136, 78–111. https://doi.org/10.1016/J.SOLENER.2016.06.069
- Barhmi, K., Heynen, C., Golroodbari, S., & van Sark, W. (2024). A Review of Solar Forecasting Techniques and the Role of Artificial Intelligence. Solar 2024, Vol. 4, Pages 99-135, 4(1), 99–135. https://doi.org/10.3390/SOLAR4010005
- Bergmeir, C., Hyndman, R. J., & Koo, B. (2018). A note on the validity of cross-validation for evaluating autoregressive time series prediction. Computational Statistics & Data Analysis, 120, 70–83. https://doi.org/10.1016/J.CSDA.2017.11.003
- Böök, H., & Lindfors, A. V. (2020). Site-specific adjustment of a NWP-based photovoltaic production forecast. Solar Energy, 211, 779–788. https://doi.org/10.1016/J.SOLENER.2020.10.024
- Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (2017). Classification and regression trees. Classification and Regression Trees, 1–358. https://doi.org/10.1201/9781315139470
- Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 13-17-Augu, 785–794. https://doi.org/10.1145/2939672.2939785
- Di Leo, P., Ciocia, A., Malgaroli, G., & Spertino, F. (2025). Advancements and Challenges in Photovoltaic Power Forecasting: A Comprehensive Review. Energies 2025, Vol. 18, Page 2108, 18(8), 2108. https://doi.org/10.3390/EN18082108
Details
Primary Language
English
Subjects
Deep Learning, Photovoltaic Power Systems
Journal Section
Research Article
Authors
Hasan Uzel
*
0000-0002-8238-2588
Türkiye
Publication Date
March 31, 2026
Submission Date
November 6, 2025
Acceptance Date
February 17, 2026
Published in Issue
Year 2026 Volume: 13 Number: 1