Research Article

A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification

Volume: 22 Number: 2 June 30, 2026
EN

A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification

Abstract

This study investigates whether compact and physically interpretable time-domain descriptors can support accurate low-concentration gas classification without relying on full multichannel waveforms. Using the Gas Sensor Array Low-Concentration dataset, each sample was transformed from raw sensor signals into 120 descriptors that summarize baseline behavior, variability, transient dynamics, and response magnitude. Four classical learning pipelines were evaluated through repeated stratified 5-fold cross-validation, supported by principal component analysis, descriptor ranking, sensor correlation analysis, and sensor subset experiments. The best-performing model, a radial support vector machine trained on the compact descriptor set, achieved a mean accuracy of 0.9456 and a macro-averaged F1 score of 0.9437 across 100 test folds while using 75 times fewer inputs than the raw waveform representation. An important finding is that classification performance improved systematically with concentration: accuracy was 0.9067 at 50 ppb, increased to 0.9333 at 100 ppb, and reached 0.9967 at 200 ppb. Response mean and baseline mean emerged as the most informative descriptor families, while VOCS-P, 2M012, and MQ-137 were the most discriminative sensors. Overall, the results show that compact and interpretable descriptors provide an efficient, reproducible, and practically useful benchmark for low-concentration gas classification in resource-constrained electronic nose systems.

Keywords

References

  1. [1]. James, D, Scott, SM, Ali, Z, O’Hare, WT. 2005. Chemical sensors for electronic nose systems. Microchimica Acta; 149(1-2): 1-17.
  2. [2]. Röck, F, Barsan, N, Weimar, U. 2008. Electronic nose: current status and future trends. Chemical Reviews; 108(2): 705-725.
  3. [3]. Tan, J, Xu, J. 2020. Applications of electronic nose and electronic tongue in food quality-related properties determination: a review. Artificial Intelligence in Agriculture; 4: 104-115.
  4. [4]. Scott, SM, James, D, Ali, Z. 2006. Data analysis for electronic nose systems. Microchimica Acta; 156(3): 183-207.
  5. [5]. Ye, Z, Liu, Y, Li, Q. 2021. Recent progress in smart electronic nose technologies enabled with machine learning methods. Sensors; 21(22): 7620.
  6. [6]. Zhai, Z, Liu, Y, Li, C, Wang, D, Wu, H. 2024. Electronic noses: from gas-sensitive components and practical applications to data processing. Sensors; 24(15): 4806.
  7. [7]. Chowdhury, MAZ, Oehlschlaeger, MA. 2025. Artificial intelligence in gas sensing: a review. ACS Sensors; 10(3): 1538-1563.
  8. [8]. Yan, J, Guo, X, Duan, S, Jia, P, Wang, L, Peng, C, Zhang, S. 2015. Electronic nose feature extraction methods: a review. Sensors; 15(11): 27804-27831.

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Publication Date

June 30, 2026

Submission Date

April 9, 2026

Acceptance Date

June 10, 2026

Published in Issue

Year 2026 Volume: 22 Number: 2

APA
Canbula, B. (2026). A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification. Celal Bayar University Journal of Science, 22(2), 401-409. https://doi.org/10.18466/cbayarfbe.1926951
AMA
1.Canbula B. A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification. CBUJOS. 2026;22(2):401-409. doi:10.18466/cbayarfbe.1926951
Chicago
Canbula, Bora. 2026. “A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification”. Celal Bayar University Journal of Science 22 (2): 401-9. https://doi.org/10.18466/cbayarfbe.1926951.
EndNote
Canbula B (June 1, 2026) A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification. Celal Bayar University Journal of Science 22 2 401–409.
IEEE
[1]B. Canbula, “A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification”, CBUJOS, vol. 22, no. 2, pp. 401–409, June 2026, doi: 10.18466/cbayarfbe.1926951.
ISNAD
Canbula, Bora. “A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification”. Celal Bayar University Journal of Science 22/2 (June 1, 2026): 401-409. https://doi.org/10.18466/cbayarfbe.1926951.
JAMA
1.Canbula B. A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification. CBUJOS. 2026;22:401–409.
MLA
Canbula, Bora. “A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification”. Celal Bayar University Journal of Science, vol. 22, no. 2, June 2026, pp. 401-9, doi:10.18466/cbayarfbe.1926951.
Vancouver
1.Bora Canbula. A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification. CBUJOS. 2026 Jun. 1;22(2):401-9. doi:10.18466/cbayarfbe.1926951