TR
EN
Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach
Abstract
This paper focuses on the application of a suite of simulation studies to assess wellknown and contemporary outlier detection methods in linear regression. These simulations vary across different parameters, including the number of observations, parameters, levels, and direction of contamination. The recorded final parameter estimates are used to rank the methods using Multiple-criteria decision-making (MCDM) tools. The study reveals that method success varies based on simulation settings. MCDM analysis results indicate a limited set of applicable methods when the contamination structure and level are unknown. Additionally, the most successful methods demand increased computation time, while some alternatives exhibit applicability within shorter durations with median rankings. These findings offer valuable insights for researchers employing regression analysis in scenarios where the underlying model is known, and the possibility of potential outliers exists.
Keywords
References
- R. Adnan, H. Setan, and M. N. Mohamad. Identifying multiple outliers in linear regression: Robust fit and clustering approach. In The 10th FIG International Symposium on Deformation Measurements, SESSION X : THEORY OF DEFORMATION ANALYSIS II, pages 380-389, Orange, California, USA, 2000. google scholar
- S. Barratt, G. Angeris, and S. Boyd. Minimizing a sum of clipped convex functions. Optimization Letters, 14:2443-2459, 2020. google scholar
- D. A. Belsley, E. Kuh, and R. E. Welsch. Regression diagnostics: Identifying influential data and sources of collinearity. 1980. ISBN 0-471-05856-4. google scholar
- J. Bezanson, A. Edelman, S. Karpinski, and V. B. Shah. Julia: A fresh approach to numerical computing. SIAM review, 59(1):65-98, 2017. doi:10.1137/141000671. google scholar
- N. Billor and G. Kiral. A comparison of multiple outlier detection methods for regression data. Communications in Statistics—Simulation and Computation®, 37(3):521-545, 2008. google scholar
- N. Billor, A. S. Hadi, and P. F. Velleman. Bacon: blocked adaptive computationally efficient outlier nominators. Computational statistics & data analysis, 34(3):279-298, 2000. google scholar
- N. Billor, S. Chatterjee, and A. S. Hadi. A re-weighted least squares method for robust regression estimation. American journal of mathematical and management sciences, 26(3-4):229-252, 2006. google scholar
- S. Chatterjee and M. Machler. Robust regression: A weighted least squares approach. Communications in Statistics-Theory andMethods, 26(6): 1381-1394, 1997. google scholar
Details
Primary Language
English
Subjects
Data Mining and Knowledge Discovery, Statistical Data Science
Journal Section
Research Article
Authors
Publication Date
December 29, 2023
Submission Date
July 14, 2023
Acceptance Date
November 10, 2023
Published in Issue
Year 2023 Volume: 7 Number: 2
APA
Satman, M. H. (2023). Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach. Acta Infologica, 7(2), 333-347. https://doi.org/10.26650/acin.1327370
AMA
1.Satman MH. Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach. ACIN. 2023;7(2):333-347. doi:10.26650/acin.1327370
Chicago
Satman, Mehmet Hakan. 2023. “Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach”. Acta Infologica 7 (2): 333-47. https://doi.org/10.26650/acin.1327370.
EndNote
Satman MH (December 1, 2023) Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach. Acta Infologica 7 2 333–347.
IEEE
[1]M. H. Satman, “Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach”, ACIN, vol. 7, no. 2, pp. 333–347, Dec. 2023, doi: 10.26650/acin.1327370.
ISNAD
Satman, Mehmet Hakan. “Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach”. Acta Infologica 7/2 (December 1, 2023): 333-347. https://doi.org/10.26650/acin.1327370.
JAMA
1.Satman MH. Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach. ACIN. 2023;7:333–347.
MLA
Satman, Mehmet Hakan. “Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach”. Acta Infologica, vol. 7, no. 2, Dec. 2023, pp. 333-47, doi:10.26650/acin.1327370.
Vancouver
1.Mehmet Hakan Satman. Comparison of Outlier Detection Methods in Linear Regression: A Multiple-Criteria Decision-Making Approach. ACIN. 2023 Dec. 1;7(2):333-47. doi:10.26650/acin.1327370