As solar energy adoption continues to rise, the demand for reliable photovoltaic (PV) systems has increased significantly. Ensuring the efficient and secure operation of PV systems requires accurate fault detection, making fault diagnosis a critical research area. This study investigates the diagnosis of short-circuit faults in PV systems by integrating machine learning algorithms with data balancing techniques. Four classifiers (Random Forest, CatBoost, Extreme Gradient Boosting, and Light Gradient Boosting Machine (LGBM)) were employed for fault classification, while Synthetic Minority Oversampling Technique (SMOTE), Random Oversampling, and Adaptive Synthetic Sampling were used to address class imbalance. Two datasets were analyzed: Dataset-1 with 11 features and Dataset-2 with 13 features. For Dataset-1, LGBM achieved the highest accuracy (79.28%) on the imbalanced data, which improved to 86.59% after applying SMOTE. By incorporating two additional features in Dataset-2, fault diagnosis accuracy increased to 98.57% on the imbalanced data and reached 100% when balanced with SMOTE. These findings demonstrate that combining LGBM with SMOTE significantly enhances short-circuit fault detection performance in PV systems.
| Primary Language | English |
|---|---|
| Subjects | Photovoltaic Power Systems |
| Journal Section | Research Article |
| Authors | |
| Submission Date | September 6, 2025 |
| Acceptance Date | September 22, 2025 |
| Publication Date | October 30, 2025 |
| DOI | https://doi.org/10.5152/tepes.2025.25038 |
| IZ | https://izlik.org/JA89BX27NW |
| Published in Issue | Year 2025 Volume: 5 Issue: 3 |