Research Article

Which pooling method is better: Max, Avg, or Concat (Max, Avg)

Volume: 66 Number: 1 June 14, 2024
EN

Which pooling method is better: Max, Avg, or Concat (Max, Avg)

Abstract

Pooling is a non-linear operation that aggregates the results of a given region to a single value. This method effectively removes extraneous details in feature maps while keeping the overall information. As a result, the size of feature maps is reduced, which decreases computing costs and prevents overfitting by eliminating irrelevant data. In CNN models, the max pooling and average pooling methods are commonly utilized. The max pooling selects the highest value within the pooling area and aids in preserving essential features of the image. However, it ignores the other values inside the pooling region, resulting in a significant loss of information. The average pooling computes the average values within the pooling area, which reduces data loss. However, by failing to emphasize critical pixels in the image, it may result in the loss of significant features. To examine the performance of pooling methods, this study comprised the experimental analysis of multiple models, i.e. shallow and deep, datasets, i.e. Cifar10, Cifar100, and SVHN, and pool sizes, e.g. $2x2$, $3x3$, $10x10$. Furthermore, the study investigated the effectiveness of combining two approaches, namely Concat (Max, Avg), to minimize information loss. The findings of this work provide an important guideline for selecting pooling methods in the design of CNNs. The experimental results demonstrate that pooling methods have a considerable impact on model performance. Moreover, there are variances based on the model and pool size.

Keywords

References

  1. Atas, I., Human gender prediction based on deep transfer learning from panoramic dental radiograph images, Trait. du Signal, 39 (5) (2022), 1585, http://dx.doi.org/10.18280/ts.390515.
  2. Atas, M., Ozdemir, C., Atas, I., Ak, B., Ozeroglu, E, Biometric identification using panoramic dental radiographic images withfew-shot learning, Turk. J. Electr. Eng., 30 (3) (2022), 1115- 1126, http://dx.doi.org/10.55730/1300-0632.3830.
  3. Ozdemir, C., Gedik, M. A., Kaya, Y., Age estimation from left-hand radiographs with deep learning methods, Trait. du Signal, 38 (6) (2021), http://dx.doi.org/10.18280/ts.380601.
  4. Krizhevsky, A., Sutskever, I., Hinton, G. E., Imagenet classification with deep convolutional neural networks, Commun. ACM, 60 (6) (2017), 84-90, http://dx.doi.org/10.1145/3065386.
  5. Tolstikhin, I. O., Houlsby, N., Kolesnikov, A., Beyer, L., Zhai, X., Unterthiner, T., Dosovitskiy, A., Mlp-mixer: An all-mlp architecture for vision, Adv. Neural Inf. Process. Syst., 34 (2021), 24261-24272, https://arxiv.org/abs/2105.01601.
  6. Meng, L., Li, H., Chen, B. C., Lan, S., Wu, Z., Jiang, Y. G., Lim, S. N., Adavit: Adaptive vision transformers for efficient image recognition, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, (2022), 12309-12318, http://dx.doi.org/10.1109/cvpr52688.2022.01199.
  7. Krizhevsky, A., Nair, V., rey Hinton, G., CIFAR-10 dataset, (2014), Available at: https://www.cs.toronto.edu/ kriz/cifar.html.
  8. Netzer, Y., Wang, T., Coates, A., Bissacco, A., Wu, B., Ng, A., The street view house numbers (SVHN) dataset, (2016). Available at: https://www.kaggle.com/datasets/stanfordu/streetview-house-numbers.

Details

Primary Language

English

Subjects

Information Systems (Other)

Journal Section

Research Article

Early Pub Date

April 7, 2024

Publication Date

June 14, 2024

Submission Date

September 6, 2023

Acceptance Date

November 17, 2023

Published in Issue

Year 2024 Volume: 66 Number: 1

APA
Doğan, Y. (2024). Which pooling method is better: Max, Avg, or Concat (Max, Avg). Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering, 66(1), 95-117. https://doi.org/10.33769/aupse.1356138
AMA
1.Doğan Y. Which pooling method is better: Max, Avg, or Concat (Max, Avg). Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2024;66(1):95-117. doi:10.33769/aupse.1356138
Chicago
Doğan, Yahya. 2024. “Which Pooling Method Is Better: Max, Avg, or Concat (Max, Avg)”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 66 (1): 95-117. https://doi.org/10.33769/aupse.1356138.
EndNote
Doğan Y (June 1, 2024) Which pooling method is better: Max, Avg, or Concat (Max, Avg). Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 66 1 95–117.
IEEE
[1]Y. Doğan, “Which pooling method is better: Max, Avg, or Concat (Max, Avg)”, Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng., vol. 66, no. 1, pp. 95–117, June 2024, doi: 10.33769/aupse.1356138.
ISNAD
Doğan, Yahya. “Which Pooling Method Is Better: Max, Avg, or Concat (Max, Avg)”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering 66/1 (June 1, 2024): 95-117. https://doi.org/10.33769/aupse.1356138.
JAMA
1.Doğan Y. Which pooling method is better: Max, Avg, or Concat (Max, Avg). Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2024;66:95–117.
MLA
Doğan, Yahya. “Which Pooling Method Is Better: Max, Avg, or Concat (Max, Avg)”. Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering, vol. 66, no. 1, June 2024, pp. 95-117, doi:10.33769/aupse.1356138.
Vancouver
1.Yahya Doğan. Which pooling method is better: Max, Avg, or Concat (Max, Avg). Commun.Fac.Sci.Univ.Ank.Series A2-A3: Phys.Sci. and Eng. 2024 Jun. 1;66(1):95-117. doi:10.33769/aupse.1356138

Cited By

Communications Faculty of Sciences University of Ankara Series A2-A3 Physical Sciences and Engineering licensed under a Creative Commons Attribution 4.0 International License.

Creative Commons License