Accurate photovoltaic (PV) power modeling is essential for grid operation, short-term scheduling and plant-level decision making. However, reported performance in the literature varies widely due to differences in feature selection, forecast horizon definition and evaluation protocols. In particular, true exogenous forecasting is often conflated with contemporaneous power tracking, leading to optimistic accuracy estimates with limited operational relevance. This study aims to clarify how feature availability, evaluation design and operational intent shape reported performance in PV power modeling. A leakage-aware and operationally realistic framework is proposed that explicitly distinguishes between exogenous forecasting and operational tracking tasks. Using high-frequency SCADA data from a utility-scale PV plant, two modeling scenarios are evaluated. Scenario A relies exclusively on exogenous meteorological inputs and represents a forecasting-oriented inference task, whereas Scenario B augments these inputs with synchronous electrical current measurements and is interpreted as operational tracking (nowcasting) rather than forward-looking prediction. Multiple regression models, with emphasis on tree-based ensemble methods, are evaluated under transparent and scenario-consistent train–test splitting strategies. Model behavior is examined using post-hoc explainability tools to assess physical consistency and temperature-induced power derating is quantified using a semiparametric regression approach. The results indicate that exogenous-only models recover physically meaningful relationships but exhibit higher numerical errors than current-augmented configurations, reflecting the informational advantage of synchronous measurements rather than improved forecasting capability. Rather than claiming superior numerical accuracy, this work demonstrates how modeling assumptions and evaluation choices directly influence reported performance and its interpretation. By aligning feature availability with operational intent, the proposed framework provides practical guidance for the development, evaluation and deployment of PV power models in real-world applications.
This study does not involve human participants, animals, or sensitive data, and therefore does not require ethical approval. The research was conducted using publicly available or institutionally collected SCADA data.
No financial or institutional support was received for this study.
The author gratefully acknowledges the support and encouragement of his family during the preparation of this manuscript.
| Primary Language | English |
|---|---|
| Subjects | Deep Learning, Photovoltaic Power Systems |
| Journal Section | Research Article |
| Authors | |
| Submission Date | November 6, 2025 |
| Acceptance Date | February 17, 2026 |
| Publication Date | March 31, 2026 |
| DOI | https://doi.org/10.54287/gujsa.1818552 |
| IZ | https://izlik.org/JA52DM97KL |
| Published in Issue | Year 2026 Volume: 13 Issue: 1 |