Research Article

Prioritization of Regression Test Cases Based on Machine Learning Methods

Volume: 38 Number: 1 March 1, 2025
EN

Prioritization of Regression Test Cases Based on Machine Learning Methods

Abstract

Due to resource and time constraints involved in the software testing process, it is not possible to implement all test scenarios for each release. Test scenarios can be prioritized according to certain criteria defined by the developers to ensure effective execution of the testing process and detection of errors. This study investigated whether machine learning based models could be used to prioritize test scenarios created in regression testing. It is attempted to determine which tests can be prioritized for execution based on different independent variables. In total, each of the 964 test scenarios in the dataset was labelled as minor (482) and major (482) by two experts. In the models, the number of related requirements, the number of related errors, and the age of the scenario were used as independent variables, and the scenario classes labelled as minor - major were taken as the target variable. The scenarios were pre-processed using natural language processing techniques and different machine learning algorithms were used for model development. In the classification based on test scenarios, the random forest algorithm showed the best performance with a F1-score of 81%. In the classification based on the number of related requirements, the number of interrelated errors, and the age of the test scenarios, the random forest model once again demonstrated the highest success rate at 79%. This study demonstrates that machine learning techniques offer a variety of models for test case prioritization.

Keywords

References

  1. [1] Atifi, M., Mamouni, A., and Marzak, A., “A comparative study of software testing techniques”, Lecture Notes in Computer Science, 10299: 373–390, (2017). DOI: 10.1007/978-3-319-59647-1_27
  2. [2] Lonetti, F., and Marchetti, E., “Emerging Software Testing Technologies”, Advances in Computers, 108: 91–143, (2018).
  3. [3] Shah, U.S., Jinwala, D.C., and Patel, S.J., “An Excursion to Software Development Life Cycle Models”, ACM SIGSOFT Software Engineering Notes, 41(1): 1–6, (2016). DOI: 10.1145/2853073.2853080
  4. [4] Khatibsyarbini, M., Isa, M.A., Jawawi, D.N.A., and Tumeng, R., “Trend Application of Machine Learning in Test Case Prioritization: A Review on Techniques”, IEEE Access, 9: 166262–166282, (2021). DOI: 10.1109/ACCESS.2021.3135508
  5. [5] Khatibsyarbini, M., Isa, M.A., Jawawi, D.N.A., and Tumeng, R., “Test case prioritization approaches in regression testing: A systematic literature review”, Information and Software Technology, 93: 74–93, (2018). DOI: 10.1016/j.infsof.2017.08.014
  6. [6] Harold, M.J., “Testing: A roadmap”, Proceedings of the Conference on the Future of Software Engineering, ICSE 2000, 61–72, (2000). DOI: 10.1145/336512.336532
  7. [7] Lou, Y., Chen, J., Zhang, L., and Hao, D., “A Survey on Regression Test-Case Prioritization”, Advances in Computers, 113: 1–46, (2019).
  8. [8] Minhas, N.M., Petersen, K., Börstler, J., and Wnuk, K., “Regression testing for large-scale embedded software development – Exploring the state of practice”, Information and Software Technology, 120: 106254, (2020). DOI: 10.1016/j.infsof.2019.106254

Details

Primary Language

English

Subjects

Software Testing, Verification and Validation

Journal Section

Research Article

Early Pub Date

December 29, 2024

Publication Date

March 1, 2025

Submission Date

March 5, 2024

Acceptance Date

November 23, 2024

Published in Issue

Year 2025 Volume: 38 Number: 1

APA
Kıran, S., Emre, İ. E., & Taşdelen, S. (2025). Prioritization of Regression Test Cases Based on Machine Learning Methods. Gazi University Journal of Science, 38(1), 131-144. https://doi.org/10.35378/gujs.1446469
AMA
1.Kıran S, Emre İE, Taşdelen S. Prioritization of Regression Test Cases Based on Machine Learning Methods. Gazi University Journal of Science. 2025;38(1):131-144. doi:10.35378/gujs.1446469
Chicago
Kıran, Selçuk, İlkim Ecem Emre, and Selen Taşdelen. 2025. “Prioritization of Regression Test Cases Based on Machine Learning Methods”. Gazi University Journal of Science 38 (1): 131-44. https://doi.org/10.35378/gujs.1446469.
EndNote
Kıran S, Emre İE, Taşdelen S (March 1, 2025) Prioritization of Regression Test Cases Based on Machine Learning Methods. Gazi University Journal of Science 38 1 131–144.
IEEE
[1]S. Kıran, İ. E. Emre, and S. Taşdelen, “Prioritization of Regression Test Cases Based on Machine Learning Methods”, Gazi University Journal of Science, vol. 38, no. 1, pp. 131–144, Mar. 2025, doi: 10.35378/gujs.1446469.
ISNAD
Kıran, Selçuk - Emre, İlkim Ecem - Taşdelen, Selen. “Prioritization of Regression Test Cases Based on Machine Learning Methods”. Gazi University Journal of Science 38/1 (March 1, 2025): 131-144. https://doi.org/10.35378/gujs.1446469.
JAMA
1.Kıran S, Emre İE, Taşdelen S. Prioritization of Regression Test Cases Based on Machine Learning Methods. Gazi University Journal of Science. 2025;38:131–144.
MLA
Kıran, Selçuk, et al. “Prioritization of Regression Test Cases Based on Machine Learning Methods”. Gazi University Journal of Science, vol. 38, no. 1, Mar. 2025, pp. 131-44, doi:10.35378/gujs.1446469.
Vancouver
1.Selçuk Kıran, İlkim Ecem Emre, Selen Taşdelen. Prioritization of Regression Test Cases Based on Machine Learning Methods. Gazi University Journal of Science. 2025 Mar. 1;38(1):131-44. doi:10.35378/gujs.1446469