EN
Evaluation of Convolutional Networks for Event Camera Face Pose Alignment
Abstract
Event camera offers substantial advantages over conventional video cameras with their efficiency, extremely high temporal resolutions, low latency, and high dynamic range. These benefits have led to applications in various vision domains. Recently they have been applied in facial recognition tasks as well. However, while significant advantages of event cameras in some facial processing tasks have been demonstrated, the initial stage in almost any task, i.e., face alignment, is not at par with the conventional cameras. This study investigates the use of face alignment convolutional networks regarding both performance and complexity for event camera processing. Our aim is event camera face pose alignment that can be used as an efficient preprocessor for facial tasks. Therefore, we comparatively evaluate simple convolutional coordinate regression with a hybrid of coordinate and heatmap regression, known as pixel-in-pixel regression. Our experimental results reveal the superior performance of the hybrid method. However, we also show that if there is a computation bottleneck, simple convolutional coordinate regression is preferable for their low resource requirements though at the expense of some performance loss.
Keywords
Supporting Institution
YAŞAR ÜNİVERSİTESİ
Project Number
BAP112
Thanks
This work was supported by the Yaşar University Project Evaluation Commission, Turkey for the project ‘‘Dynamic Facial Analysis with Neuromorphic Camera’’ [grant number: BAP112].
References
- G. Gallego et al., “Event-Based Vision: A Survey,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 44, no. 1, pp. 154–180, Jan. 2022, doi: 10.1109/TPAMI.2020.3008413.
- G. Tan, Y. Wang, H. Han, Y. Cao, F. Wu, and Z.-J. Zha, “Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading,” in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA: IEEE, Jun. 2022, pp. 20062–20071. doi: 10.1109/CVPR52688.2022.01946.
- G. Moreira, A. Graca, B. Silva, P. Martins, and J. Batista, “Neuromorphic Event-based Face Identity Recognition,” in 2022 26th International Conference on Pattern Recognition (ICPR), Montreal, QC, Canada: IEEE, Aug. 2022, pp. 922–929. doi: 10.1109/ICPR56361.2022.9956236.
- A. Savran, “Fully Convolutional Event-camera Voice Activity Detection Based on Event Intensity,” in 2023 Innovations in Intelligent Systems and Applications Conference (ASYU), Sivas, Turkiye: IEEE, Oct. 2023, pp. 1–6. doi: 10.1109/ASYU58738.2023.10296754.
- A. Savran, “Multi-timescale boosting for efficient and improved event camera face pose alignment,” Computer Vision and Image Understanding, vol. 236, p. 103817, Nov. 2023, doi: 10.1016/j.cviu.2023.103817.
- A. Savran and C. Bartolozzi, “Face Pose Alignment with Event Cameras,” Sensors, vol. 20, no. 24, p. 7079, Dec. 2020, doi: 10.3390/s20247079.
- Z.-H. Feng, J. Kittler, M. Awais, and X.-J. Wu, “Rectified Wing Loss for Efficient and Robust Facial Landmark Localisation with Convolutional Neural Networks,” Int J Comput Vis, vol. 128, no. 8–9, pp. 2126–2145, Sep. 2020, doi: 10.1007/s11263-019-01275-0.
- H. Jin, S. Liao, and L. Shao, “Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild,” Int J Comput Vis, vol. 129, no. 12, pp. 3174–3194, Dec. 2021, doi: 10.1007/s11263-021-01521-4.
Details
Primary Language
English
Subjects
Deep Learning, Machine Vision
Journal Section
Research Article
Early Pub Date
May 30, 2025
Publication Date
May 31, 2025
Submission Date
January 9, 2024
Acceptance Date
January 16, 2025
Published in Issue
Year 2025 Volume: 13 Number: 2
APA
Oral, B. B., Çakıcı, A., & Savran, A. (2025). Evaluation of Convolutional Networks for Event Camera Face Pose Alignment. Academic Platform Journal of Engineering and Smart Systems, 13(2), 22-30. https://doi.org/10.21541/apjess.1417068
AMA
1.Oral BB, Çakıcı A, Savran A. Evaluation of Convolutional Networks for Event Camera Face Pose Alignment. APJESS. 2025;13(2):22-30. doi:10.21541/apjess.1417068
Chicago
Oral, Burhan Burak, Alptuğ Çakıcı, and Arman Savran. 2025. “Evaluation of Convolutional Networks for Event Camera Face Pose Alignment”. Academic Platform Journal of Engineering and Smart Systems 13 (2): 22-30. https://doi.org/10.21541/apjess.1417068.
EndNote
Oral BB, Çakıcı A, Savran A (May 1, 2025) Evaluation of Convolutional Networks for Event Camera Face Pose Alignment. Academic Platform Journal of Engineering and Smart Systems 13 2 22–30.
IEEE
[1]B. B. Oral, A. Çakıcı, and A. Savran, “Evaluation of Convolutional Networks for Event Camera Face Pose Alignment”, APJESS, vol. 13, no. 2, pp. 22–30, May 2025, doi: 10.21541/apjess.1417068.
ISNAD
Oral, Burhan Burak - Çakıcı, Alptuğ - Savran, Arman. “Evaluation of Convolutional Networks for Event Camera Face Pose Alignment”. Academic Platform Journal of Engineering and Smart Systems 13/2 (May 1, 2025): 22-30. https://doi.org/10.21541/apjess.1417068.
JAMA
1.Oral BB, Çakıcı A, Savran A. Evaluation of Convolutional Networks for Event Camera Face Pose Alignment. APJESS. 2025;13:22–30.
MLA
Oral, Burhan Burak, et al. “Evaluation of Convolutional Networks for Event Camera Face Pose Alignment”. Academic Platform Journal of Engineering and Smart Systems, vol. 13, no. 2, May 2025, pp. 22-30, doi:10.21541/apjess.1417068.
Vancouver
1.Burhan Burak Oral, Alptuğ Çakıcı, Arman Savran. Evaluation of Convolutional Networks for Event Camera Face Pose Alignment. APJESS. 2025 May 1;13(2):22-30. doi:10.21541/apjess.1417068