Cost-Weighted Harmonic Score: A Unified Metric for Cost-Sensitive Classification on High-Stakes Imbalanced Data
Abstract
Machine learning classifiers deployed in high-stakes domains like healthcare and finance face the dual challenges of class imbalance and asymmetric misclassification costs, which are poorly addressed by traditional evaluation metrics. The primary purpose of this study is to address this critical gap by developing and validating the Cost-Weighted Harmonic (CWH) score, a novel, bounded performance metric that unifies precision, recall, and specificity within a normalized harmonic mean, explicitly weighted by a user-defined cost ratio, for high-stakes imbalanced classification. Unlike cost-agnostic metrics (e.g., F1, HMRS) or unbounded cost-aware scores (e.g., C-score), CWH is interpretable, stable, and aligns evaluation with domain-specific risk priorities. It is integrated with threshold optimization and validated across healthcare, cybersecurity, and financial datasets, demonstrating superior stability and up to 69% performance improvement against C-score in life-critical scenarios without excessive false positives. CWH effectively bridges the gap between statistical evaluation and operational decision-making, offering practitioners a reliable tool for model selection that aligns with domain-specific risk priorities.
Keywords
References
- A. Dwarampudi and M. K. Yogi, “A robust machine learning model for cyber incident classification and prioritization,” J. Trends Comput. Sci. Smart Technol., vol. 6, no. 1, pp. 51–66, Mar. 2024. doi: 10.36548/jtcsst.2024.1.004
- S. Mohsen, “Alzheimer’s disease detection using deep learning and machine learning: A review,” Artif. Intell. Rev., vol. 58, Art. no. 250, Sep. 2025. doi: 10.1007/s10462-025-11258-y
- R. O. Ogundokun, S. Misra, O. E. Ogundokun, J. B. Oluranti, and R. Maskeliunas, “Machine learning classification-based techniques for fraud discovery in credit card datasets,” in Applied Informatics, Cham, Switzerland: Springer, 2021, pp. 26–38. doi: 10.1007/978-3-030-89654-6_3
- M. Marwah, A. Narayanan, S. Jou, M. Arlitt, and M. Pospelova, “Is 𝐹1 Score Suboptimal for Cybersecurity Models? Introducing 𝐶𝑠𝑐𝑜𝑟𝑒, a Cost-Aware Alternative for Model Assessment,” in CEUR Workshop Proceedings, Arlington, VA, Oct. 2024, pp. 190–209.
- S. A. Hicks et al., “On evaluation metrics for medical applications of artificial intelligence,” Sci. Rep., vol. 12, Art. no. 5979, Apr. 2022. doi: 10.1038/s41598-022-09954-8
- H. Chen, N. Wang, X. Du, K. Mei, Y. Zhou, and G. Cai, “Classification prediction of breast cancer based on machine learning,” Comput. Intell. Neurosci., vol. 2023, Art. no. 6530719, Jan. 2023. doi: 10.1155/2023/6530719
- M. M. Islam, M. R. Haque, H. Iqbal, M. M. Hasan, M. Hasan, and M. N. Kabir, “Breast cancer prediction: A comparative study using machine learning techniques,” SN Comput. Sci., vol. 1, Art. no. 290, Sep. 2020. doi: 10.1007/s42979-020-00305-w
- A. Aslam and A. Hussain, “A performance analysis of machine learning techniques for credit card fraud detection,” J. Artif. Intell., vol. 6, 2024. doi: 10.32604/jai.2024.047226
Details
Primary Language
English
Subjects
Artificial Intelligence (Other)
Journal Section
Research Article
Authors
Philip Odion
0009-0006-2194-1370
Nigeria
Early Pub Date
June 8, 2026
Publication Date
June 17, 2026
Submission Date
September 29, 2025
Acceptance Date
February 8, 2026
Published in Issue
Year 2026 Volume: 9 Number: 2
