This paper proposes an artificial intelligence-based solution to classify the three basic intrinsic prediction modes (Planar, DC and Angular) defined in the High Efficiency Video Coding (HEVC) standard. A convolutional neural network (CNN) based deep learning model trained with 32×32 blocks obtained from 30+ classical gray level test images is developed. As a result of the training, the model demonstrated a successful classification performance with an overall accuracy of over 89% and a macro F1 score of approximately 88%. The model was converted into ONNX format and integrated into a Unity-based virtual reality (VR) environment, thus creating an interactive analysis platform where users can observe the predictions of both artificial intelligence and rule-based systems at the block level comparatively. In this environment, users can also examine the reasoning of the predictions. The proposed system provides a holistic solution in terms of classification performance, interpretability and user experience, and makes innovative contributions to the analysis and visualization of video coding processes for educational purposes.
HEVC Intra prediction Convolutional neural network Rule-based expert system VR visulation Video coding
Ethics committee approval was not required for this study because there was no study on animals or humans.
This paper proposes an artificial intelligence-based solution to classify the three basic intrinsic prediction modes (Planar, DC and Angular) defined in the High Efficiency Video Coding (HEVC) standard. A convolutional neural network (CNN) based deep learning model trained with 32×32 blocks obtained from 30+ classical gray level test images is developed. As a result of the training, the model demonstrated a successful classification performance with an overall accuracy of over 89% and a macro F1 score of approximately 88%. The model was converted into ONNX format and integrated into a Unity-based virtual reality (VR) environment, thus creating an interactive analysis platform where users can observe the predictions of both artificial intelligence and rule-based systems at the block level comparatively. In this environment, users can also examine the reasoning of the predictions. The proposed system provides a holistic solution in terms of classification performance, interpretability and user experience, and makes innovative contributions to the analysis and visualization of video coding processes for educational purposes.
HEVC Intra prediction Convolutional neural network Rule-based expert system VR visulation Video coding
Ethics committee approval was not required for this study because there was no study on animals or humans.
| Birincil Dil | İngilizce |
|---|---|
| Konular | Sinyal İşleme |
| Bölüm | Research Articles |
| Yazarlar | |
| Erken Görünüm Tarihi | 12 Kasım 2025 |
| Yayımlanma Tarihi | 15 Kasım 2025 |
| Gönderilme Tarihi | 26 Mayıs 2025 |
| Kabul Tarihi | 28 Ekim 2025 |
| Yayımlandığı Sayı | Yıl 2025 Cilt: 8 Sayı: 6 |