A Compact and Explainable Machine Learning Pipeline for Low-Concentration Gas Sensor Array Classification
Abstract
This study investigates whether compact and physically interpretable time-domain descriptors can support accurate low-concentration gas classification without relying on full multichannel waveforms. Using the Gas Sensor Array Low-Concentration dataset, each sample was transformed from raw sensor signals into 120 descriptors that summarize baseline behavior, variability, transient dynamics, and response magnitude. Four classical learning pipelines were evaluated through repeated stratified 5-fold cross-validation, supported by principal component analysis, descriptor ranking, sensor correlation analysis, and sensor subset experiments. The best-performing model, a radial support vector machine trained on the compact descriptor set, achieved a mean accuracy of 0.9456 and a macro-averaged F1 score of 0.9437 across 100 test folds while using 75 times fewer inputs than the raw waveform representation. An important finding is that classification performance improved systematically with concentration: accuracy was 0.9067 at 50 ppb, increased to 0.9333 at 100 ppb, and reached 0.9967 at 200 ppb. Response mean and baseline mean emerged as the most informative descriptor families, while VOCS-P, 2M012, and MQ-137 were the most discriminative sensors. Overall, the results show that compact and interpretable descriptors provide an efficient, reproducible, and practically useful benchmark for low-concentration gas classification in resource-constrained electronic nose systems.
Keywords
References
- [1]. James, D, Scott, SM, Ali, Z, O’Hare, WT. 2005. Chemical sensors for electronic nose systems. Microchimica Acta; 149(1-2): 1-17.
- [2]. Röck, F, Barsan, N, Weimar, U. 2008. Electronic nose: current status and future trends. Chemical Reviews; 108(2): 705-725.
- [3]. Tan, J, Xu, J. 2020. Applications of electronic nose and electronic tongue in food quality-related properties determination: a review. Artificial Intelligence in Agriculture; 4: 104-115.
- [4]. Scott, SM, James, D, Ali, Z. 2006. Data analysis for electronic nose systems. Microchimica Acta; 156(3): 183-207.
- [5]. Ye, Z, Liu, Y, Li, Q. 2021. Recent progress in smart electronic nose technologies enabled with machine learning methods. Sensors; 21(22): 7620.
- [6]. Zhai, Z, Liu, Y, Li, C, Wang, D, Wu, H. 2024. Electronic noses: from gas-sensitive components and practical applications to data processing. Sensors; 24(15): 4806.
- [7]. Chowdhury, MAZ, Oehlschlaeger, MA. 2025. Artificial intelligence in gas sensing: a review. ACS Sensors; 10(3): 1538-1563.
- [8]. Yan, J, Guo, X, Duan, S, Jia, P, Wang, L, Peng, C, Zhang, S. 2015. Electronic nose feature extraction methods: a review. Sensors; 15(11): 27804-27831.
Details
Primary Language
English
Subjects
Computer Software
Journal Section
Research Article
Authors
Bora Canbula
*
0000-0003-1088-2804
Türkiye
Publication Date
June 30, 2026
Submission Date
April 9, 2026
Acceptance Date
June 10, 2026
Published in Issue
Year 2026 Volume: 22 Number: 2