Research Article

Children of the Tree: Optimised Rule Extraction from Machine Learning Models

Volume: 1 Number: 1 January 30, 2025
EN

Children of the Tree: Optimised Rule Extraction from Machine Learning Models

Abstract

The “Children of the Tree” algorithm provides a strong understanding of how the imbalanced dataset is classified by extracting rules from each tree of the Random Forest (RF) model. Basically, it converts the divisions created at each node of the trees into “if-then” rules and extracts individual rules for each tree by differentiating the general “community model” perception in the RF. Thus, the algorithm finds the “Children of the Tree” by converting the forest into a rule set. This study, developed on the “German Credit Data Set”, which is one of the banking data sets on which many studies have been conducted in the literature; determines the rules that cause to fall into that class(class good or class bad) for candidate customers. In this way, the bank would see the rules for potential customers belonging to the risky class and have the chance to recommend the alternative plans/products that are suitable for their risk strategy to their potential customers. The study evaluates rule validity and reliability using association rule mining metrics—support, confidence, lift, leverage, conviction - calculates "Minimum Description Length" (MDL), and ranks rules by "support" and "MDL cost" to extract the simplest rules for each class. It addresses risk management in banking and marketing needs, using MDL cost and SMOTE to handle imbalanced datasets, setting it apart from other algorithms.

Keywords

References

  1. Leo Breiman. 2001. Random Forests. Machine Learning. 45, 1 (Oct. 2001), 5-32.https://doi.org/10.1023/A:1010933404324. google scholar
  2. Houtao Deng. 2019. Interpreting Tree Ensembles with inTrees. International Journal of Data Science and AnalYtics. 7, 4 (Dec. 2019), 277-287. https://doi.org/l0.1007/s41060-018-0144-8. google scholar
  3. Kim Phung Lu Thi, Ngoc Chau Vo Thi, and NguYen Hua Phung. 2015. Extracting Rule RF in Educational Data Classification: From a Random Forest to Interpretable Refined Rules. In Proceedings of the 2015 International Conference on Advanced Computing and Applications (ACOMP). 20-27. https://doi.org/10.1109/ACOMP.2015.13. google scholar
  4. element Benard, Gerard Biau, Sebastien da Veiga, and Erwan Scornet. 2019. SIRUS: Stable and Interpretable Rule Set for Classi-fication. arXiv:1908.06852. Retrieved from https://arxiv.org/abs/1908.06852. google scholar
  5. Morteza Mashayekhi and Robin Gras. 2015. Rule Extraction from Random Forest: The C Methods. In Kanade, T., Kittler, J., Kleinberg, J.M., et al. (Eds.). Advances in Artificial Intelligence. Vol. 3060. Springer Berlin Heidelberg, Berlin, Heidelberg, 223237. 10.1007/978-3-319-18356-5_20 google scholar
  6. Satoshi Hara and Kohei HaYashi. 2017. Making Tree Ensembles Interpretable: A BaYesian Model Selection Approach.arXiv:1606.09066. Retrieved from https://arxiv.org/abs/1606.09066. google scholar
  7. Md Nasim Adnan and Md Zahidul Islam. 2017. For A New Framework for Knowledge DiscoverY from Decision Forests. Australasian Journal of Information SYstems. 21, (Nov. 2017). https://doi.org/10.3127/ajis.v21i0.1539. google scholar
  8. S. Ilker Birbil, Mert Edali, and Birol Yuceoglu. 2020. Rule Covering for Interpretation and Boosting. arXiv:2007.06379.Retrieved from https://arxiv.org/abs/2007.06379. google scholar

Details

Primary Language

English

Subjects

Artificial Intelligence (Other)

Journal Section

Research Article

Publication Date

January 30, 2025

Submission Date

December 24, 2024

Acceptance Date

January 20, 2025

Published in Issue

Year 2025 Volume: 1 Number: 1

APA
Meydan, H., & Bal, M. (2025). Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications, 1(1), 14-35. https://izlik.org/JA78ZL99GP
AMA
1.Meydan H, Bal M. Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications. 2025;1(1):14-35. https://izlik.org/JA78ZL99GP
Chicago
Meydan, Hilal, and Mert Bal. 2025. “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”. Journal of Data Analytics and Artificial Intelligence Applications 1 (1): 14-35. https://izlik.org/JA78ZL99GP.
EndNote
Meydan H, Bal M (January 1, 2025) Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications 1 1 14–35.
IEEE
[1]H. Meydan and M. Bal, “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”, Journal of Data Analytics and Artificial Intelligence Applications, vol. 1, no. 1, pp. 14–35, Jan. 2025, [Online]. Available: https://izlik.org/JA78ZL99GP
ISNAD
Meydan, Hilal - Bal, Mert. “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”. Journal of Data Analytics and Artificial Intelligence Applications 1/1 (January 1, 2025): 14-35. https://izlik.org/JA78ZL99GP.
JAMA
1.Meydan H, Bal M. Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications. 2025;1:14–35.
MLA
Meydan, Hilal, and Mert Bal. “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”. Journal of Data Analytics and Artificial Intelligence Applications, vol. 1, no. 1, Jan. 2025, pp. 14-35, https://izlik.org/JA78ZL99GP.
Vancouver
1.Hilal Meydan, Mert Bal. Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications [Internet]. 2025 Jan. 1;1(1):14-35. Available from: https://izlik.org/JA78ZL99GP