Research Article

Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach

Number: 40 September 30, 2022
EN

Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach

Abstract

Multivariate Adaptive Regression Splines (MARS) is a supervised learning model in machine learning, not obtained by an ensemble learning method. Ensemble learning methods are gathered from samples comprising hundreds or thousands of learners that serve the common purpose of improving the stability and accuracy of machine learning algorithms. This study presented REMARS (Random Ensemble MARS), a new MARS model selection approach obtained using the Random Forest (RF) algorithm. 200 training and test data set generated via the Bagging method were analysed in the MARS analysis engine. At the end of the analysis, two different MARS model sets were created, one yielding the smallest Mean Square Error for the test data (Test MSE) and the other yielding the smallest Generalised Cross-Validation (GCV) value. The best model was estimated for both Test MSE and GCV criteria by examining the error of measurement criteria, variable importance averages, and frequencies of the knot values for each model. Eventually, a new model was obtained via the ensemble learning method, i.e., REMARS, that yields result as good as the MARS model obtained from the original data set. The MARS model, which works better in the larger data set, provides more reliable results with smaller data sets utilising the proposed method.

Keywords

References

  1. S. Theodoridis, Machine Learning a Bayesian and Optimisation Perspective, Academic Press of Elsevier, 125 London Wall, London, 2015.
  2. S. Suthaharan, Machine Learning Models and Algorithms for Big Data Classification, Springer International Publishing, New York, 2016.
  3. T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning: Data Mining, Inference and Prediction, Springer Series in Statistics, Stanford, California, 2001.
  4. T. K. Ho, Random Decision Forests, Proceedings of 3rd International Conference on Document Analysis and Recognition (IEEE), Montreal, Canada, 1995, pp. 278–282.
  5. T. K. Ho, The Random Subspace Method for Constructing Decision Forests, IEEE Transactions on Pattern Analysis and Machine Intelligence 20 (2) (1998) 832–844.
  6. T. Hill, P. Lewicki, Statistics: Methods and Applications, StatSoft, Tulsa OK, 2006.
  7. J. R. Leathwick, J. Elith, T. Hastie, Comparative Performance of Generalised Additive Models and Multivariate Adaptive Regression Splines for Statistical Modelling of Species Distributions, Ecological Modelling 199 (2) (2006) 188–196.
  8. D. Yao, J. Yang, X. Zhan, A Novel Method for Disease Prediction: Hybrid of Random Forest and Multivariate Adaptive Regression Splines, Journal of Computers 8 (1) (2013) 170–177.

Details

Primary Language

English

Subjects

Applied Mathematics

Journal Section

Research Article

Publication Date

September 30, 2022

Submission Date

July 22, 2022

Acceptance Date

September 27, 2022

Published in Issue

Year 2022 Number: 40

APA
Sabancı, D., & Cengiz, M. A. (2022). Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach. Journal of New Theory, 40, 27-45. https://doi.org/10.53570/jnt.1147323
AMA
1.Sabancı D, Cengiz MA. Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach. JNT. 2022;(40):27-45. doi:10.53570/jnt.1147323
Chicago
Sabancı, Dilek, and Mehmet Ali Cengiz. 2022. “Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach”. Journal of New Theory, nos. 40: 27-45. https://doi.org/10.53570/jnt.1147323.
EndNote
Sabancı D, Cengiz MA (September 1, 2022) Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach. Journal of New Theory 40 27–45.
IEEE
[1]D. Sabancı and M. A. Cengiz, “Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach”, JNT, no. 40, pp. 27–45, Sept. 2022, doi: 10.53570/jnt.1147323.
ISNAD
Sabancı, Dilek - Cengiz, Mehmet Ali. “Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach”. Journal of New Theory. 40 (September 1, 2022): 27-45. https://doi.org/10.53570/jnt.1147323.
JAMA
1.Sabancı D, Cengiz MA. Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach. JNT. 2022;:27–45.
MLA
Sabancı, Dilek, and Mehmet Ali Cengiz. “Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach”. Journal of New Theory, no. 40, Sept. 2022, pp. 27-45, doi:10.53570/jnt.1147323.
Vancouver
1.Dilek Sabancı, Mehmet Ali Cengiz. Random Ensemble MARS: Model Selection in Multivariate Adaptive Regression Splines Using Random Forest Approach. JNT. 2022 Sep. 1;(40):27-45. doi:10.53570/jnt.1147323

Cited By

 

TR Dizin 26024
 
Electronic Journals Library 13651
 
                                EBSCO 36309                                     DOAJ 33468
Scilit 20865                                                         SOBİAD 30256

 

29324 JNT is licensed under a Creative Commons Attribution-NonCommercial 4.0 International Licence (CC BY-NC)
 

The Journal of New Theory's website content and procedures are publicly accessible under the CC BY-NC license; commercial use requires our permission.