EN
Children of the Tree: Optimised Rule Extraction from Machine Learning Models
Abstract
The “Children of the Tree” algorithm provides a strong understanding of how the imbalanced dataset is classified by extracting rules from each tree of the Random Forest (RF) model. Basically, it converts the divisions created at each node of the trees into “if-then” rules and extracts individual rules for each tree by differentiating the general “community model” perception in the RF. Thus, the algorithm finds the “Children of the Tree” by converting the forest into a rule set. This study, developed on the “German Credit Data Set”, which is one of the banking data sets on which many studies have been conducted in the literature; determines the rules that cause to fall into that class(class good or class bad) for candidate customers. In this way, the bank would see the rules for potential customers belonging to the risky class and have the chance to recommend the alternative plans/products that are suitable for their risk strategy to their potential customers. The study evaluates rule validity and reliability using association rule mining metrics—support, confidence, lift, leverage, conviction - calculates "Minimum Description Length" (MDL), and ranks rules by "support" and "MDL cost" to extract the simplest rules for each class. It addresses risk management in banking and marketing needs, using MDL cost and SMOTE to handle imbalanced datasets, setting it apart from other algorithms.
Keywords
References
- Leo Breiman. 2001. Random Forests. Machine Learning. 45, 1 (Oct. 2001), 5-32.https://doi.org/10.1023/A:1010933404324. google scholar
- Houtao Deng. 2019. Interpreting Tree Ensembles with inTrees. International Journal of Data Science and AnalYtics. 7, 4 (Dec. 2019), 277-287. https://doi.org/l0.1007/s41060-018-0144-8. google scholar
- Kim Phung Lu Thi, Ngoc Chau Vo Thi, and NguYen Hua Phung. 2015. Extracting Rule RF in Educational Data Classification: From a Random Forest to Interpretable Refined Rules. In Proceedings of the 2015 International Conference on Advanced Computing and Applications (ACOMP). 20-27. https://doi.org/10.1109/ACOMP.2015.13. google scholar
- element Benard, Gerard Biau, Sebastien da Veiga, and Erwan Scornet. 2019. SIRUS: Stable and Interpretable Rule Set for Classi-fication. arXiv:1908.06852. Retrieved from https://arxiv.org/abs/1908.06852. google scholar
- Morteza Mashayekhi and Robin Gras. 2015. Rule Extraction from Random Forest: The C Methods. In Kanade, T., Kittler, J., Kleinberg, J.M., et al. (Eds.). Advances in Artificial Intelligence. Vol. 3060. Springer Berlin Heidelberg, Berlin, Heidelberg, 223237. 10.1007/978-3-319-18356-5_20 google scholar
- Satoshi Hara and Kohei HaYashi. 2017. Making Tree Ensembles Interpretable: A BaYesian Model Selection Approach.arXiv:1606.09066. Retrieved from https://arxiv.org/abs/1606.09066. google scholar
- Md Nasim Adnan and Md Zahidul Islam. 2017. For A New Framework for Knowledge DiscoverY from Decision Forests. Australasian Journal of Information SYstems. 21, (Nov. 2017). https://doi.org/10.3127/ajis.v21i0.1539. google scholar
- S. Ilker Birbil, Mert Edali, and Birol Yuceoglu. 2020. Rule Covering for Interpretation and Boosting. arXiv:2007.06379.Retrieved from https://arxiv.org/abs/2007.06379. google scholar
Details
Primary Language
English
Subjects
Artificial Intelligence (Other)
Journal Section
Research Article
Publication Date
January 30, 2025
Submission Date
December 24, 2024
Acceptance Date
January 20, 2025
Published in Issue
Year 2025 Volume: 1 Number: 1
APA
Meydan, H., & Bal, M. (2025). Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications, 1(1), 14-35. https://izlik.org/JA78ZL99GP
AMA
1.Meydan H, Bal M. Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications. 2025;1(1):14-35. https://izlik.org/JA78ZL99GP
Chicago
Meydan, Hilal, and Mert Bal. 2025. “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”. Journal of Data Analytics and Artificial Intelligence Applications 1 (1): 14-35. https://izlik.org/JA78ZL99GP.
EndNote
Meydan H, Bal M (January 1, 2025) Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications 1 1 14–35.
IEEE
[1]H. Meydan and M. Bal, “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”, Journal of Data Analytics and Artificial Intelligence Applications, vol. 1, no. 1, pp. 14–35, Jan. 2025, [Online]. Available: https://izlik.org/JA78ZL99GP
ISNAD
Meydan, Hilal - Bal, Mert. “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”. Journal of Data Analytics and Artificial Intelligence Applications 1/1 (January 1, 2025): 14-35. https://izlik.org/JA78ZL99GP.
JAMA
1.Meydan H, Bal M. Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications. 2025;1:14–35.
MLA
Meydan, Hilal, and Mert Bal. “Children of the Tree: Optimised Rule Extraction from Machine Learning Models”. Journal of Data Analytics and Artificial Intelligence Applications, vol. 1, no. 1, Jan. 2025, pp. 14-35, https://izlik.org/JA78ZL99GP.
Vancouver
1.Hilal Meydan, Mert Bal. Children of the Tree: Optimised Rule Extraction from Machine Learning Models. Journal of Data Analytics and Artificial Intelligence Applications [Internet]. 2025 Jan. 1;1(1):14-35. Available from: https://izlik.org/JA78ZL99GP