Research Article

A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models

Volume: 14 Number: 2 December 31, 2022
EN

A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models

Abstract

Recently, collection of huge amount of data and analysis of that much data have vital importance for human activities in many different application areas. Advanced statistical methods play crucial role for modeling of such data when the data contains outliers. Although there are number of outlier detection methods for revealing outlier observations in data, most of them may not be reasonable and appropriate for prediction purposes due to structural and requirements of modeling. In this study, density based clustering algorithm named Density-Based Spatial Clustering of Applications with Noise (DBSCAN) is considered in order to detect the location of outlier observations effectively with respect to form of the model for given data set. Based on obtained results, the Mean Shift Outlier Model (MSOM) is constructed as a robust linear model. This newly proposed computational approach based on DBSCAN uses power of data clustering and also minimize the impact of the outlier observations by MSOM. The numerical examples are also presented to reveal the performance of the proposed approach in this study.

Keywords

References

  1. Bakar, Z.A., Mohemad, R., Ahmad, A. and Deris, M.M. (2006). A comparative study for outlier detection techniques in data mining. In 2006 IEEE Conference on Cybernetics and Intelligent Systems IEEE, 1-6.
  2. Barnett, V. and Lewis, T. (1994). Outliers in Statistical Data. Wiley, Great Britain.
  3. Campulova, M., Michalek, J. and Moucka, J. (2019). Generalised linear model-based algorithm for detection of outliers in environmental data and comparison with semi-parametric outlier detection methods. Atmospheric Pollution Research, 10(4), 1015-1023.
  4. Cook, R.D. (1979). Influential observations in linear regression, Journal of the American Statistical Association, 74, 1691-74.
  5. Cook, R.D. and Weisberg, S. (1982). Residuals and Influence in Regression. Chapman and Hall, New York.
  6. Daneshgar, A., Javadi, R. and Razavi, S.S. (2013). Clustering and outlier detection using isoperimetric number of trees. Pattern Recognition, 46(12), 3371-3382.
  7. Ester, M., Kriegel, H.P., Sander, J. and Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD’96 Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, 96, 226–231.
  8. Gan, G., Ma, C. and Wu, J. (2020). Data Clustering: Theory, Algorithms, and Applications. Philadelphia, PA, USA SIAM Press.

Details

Primary Language

English

Subjects

Mathematical Sciences

Journal Section

Research Article

Publication Date

December 31, 2022

Submission Date

March 19, 2022

Acceptance Date

July 27, 2022

Published in Issue

Year 2022 Volume: 14 Number: 2

APA
Yerlikaya Özkurt, F. (2022). A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models. Istatistik Journal of The Turkish Statistical Association, 14(2), 87-96. https://izlik.org/JA99PM46MY
AMA
1.Yerlikaya Özkurt F. A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models. IJTSA. 2022;14(2):87-96. https://izlik.org/JA99PM46MY
Chicago
Yerlikaya Özkurt, Fatma. 2022. “A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models”. Istatistik Journal of The Turkish Statistical Association 14 (2): 87-96. https://izlik.org/JA99PM46MY.
EndNote
Yerlikaya Özkurt F (December 1, 2022) A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models. Istatistik Journal of The Turkish Statistical Association 14 2 87–96.
IEEE
[1]F. Yerlikaya Özkurt, “A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models”, IJTSA, vol. 14, no. 2, pp. 87–96, Dec. 2022, [Online]. Available: https://izlik.org/JA99PM46MY
ISNAD
Yerlikaya Özkurt, Fatma. “A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models”. Istatistik Journal of The Turkish Statistical Association 14/2 (December 1, 2022): 87-96. https://izlik.org/JA99PM46MY.
JAMA
1.Yerlikaya Özkurt F. A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models. IJTSA. 2022;14:87–96.
MLA
Yerlikaya Özkurt, Fatma. “A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models”. Istatistik Journal of The Turkish Statistical Association, vol. 14, no. 2, Dec. 2022, pp. 87-96, https://izlik.org/JA99PM46MY.
Vancouver
1.Fatma Yerlikaya Özkurt. A New Computational Approach Based on Density Clustering for Outlier Problems in Linear Models. IJTSA [Internet]. 2022 Dec. 1;14(2):87-96. Available from: https://izlik.org/JA99PM46MY