Vision Transformer-Based Approach: A Novel Method for Object Recognition
Abstract
Keywords
Object recognition, Vision Transformer, Logistic Regression, Caltech 101, Image Processing, Artificial Intelligence
References
- Amerini, I., Ballan, L., Caldelli, R., Del Bimbo, A., & Serra, G. (2011). A SIFT-based forensic method for copy-move attack detection and transformation recovery. IEEE Transactions on Information Forensics and Security, 6(3 PART 2), 1099–1110. https://doi.org/10.1109/TIFS.2011.2129512
- Bansal, M., Kumar, M., & Kumar, M. (2021). 2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors. Multimedia Tools and Applications, 80(12), 18839–18857. https://doi.org/10.1007/s11042-021-10646-0
- Bansal, M., Kumar, M., Kumar, M., & Kumar, K. (2021). An efficient technique for object recognition using Shi-Tomasi corner detection algorithm. Soft Computing, 25(6), 4423–4432. https://doi.org/10.1007/s00500-020-05453-y
- Bosch, A., Zisserman, A., & Muñoz, X. (2007). Image classification using random forests and ferns. Proceedings of the IEEE International Conference on Computer Vision. https://doi.org/10.1109/ICCV.2007.4409066
- Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., … Houlsby, N. (2021). an Image Is Worth 16X16 Words: Transformers for Image Recognition At Scale. ICLR 2021 - 9th International Conference on Learning Representations.
- Fei-Fei, L., Fergus, R., & Perona, P. (2004). Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Computer Vision and Pattern Recognition Workshop, 178. https://doi.org/10.1016/j.cviu.2005.09.012
- Fei-Fei, L., Fergus, R., & Perona, P. (2006). One-shot learning of object categories. IEEE Transactions on Pattern Analysis and Machine Intelligence, 28(4), 594–611. https://doi.org/10.1109/TPAMI.2006.79
- Gupta, S., Kumar, M., & Garg, A. (2019). Improved object recognition results using SIFT and ORB feature detector. Multimedia Tools and Applications, 78(23), 34157–34171. https://doi.org/10.1007/s11042-019-08232-6
- Hussain, N., Khan, M. A., Sharif, M., Khan, S. A., Albesher, A. A., Saba, T., & Armaghan, A. (2024). A deep neural network and classical features based scheme for objects recognition: an application for machine inspection. Multimedia Tools and Applications, 83(5), 14935–14957. https://doi.org/10.1007/s11042-020-08852-3
- Jalal, A., Ahmed, A., Rafique, A. A., & Kim, K. (2021). Scene Semantic Recognition Based on Modified Fuzzy C-Mean and Maximum Entropy Using Object-to-Object Relations. IEEE Access, 9, 27758–27772. https://doi.org/10.1109/ACCESS.2021.3058986