Research Article
BibTex RIS Cite

Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi

Year 2022, Volume: 15 Issue: 1, 24 - 32, 27.06.2022
https://doi.org/10.54525/tbbmd.1051177

Abstract

Teknolojinin gün geçtikçe gelişmesiyle birlikte hayatımızdaki yeri ve önemi artmaktadır. Gelişen teknoloji, birçok cihazın birbirleriyle ve insanlarla olan etkileşimini arttırmıştır. Bu etkileşimin sonucunda ortaya büyük miktarda veri çıkmaktadır. Gerçek zamanlı üretilen bu veriler, üretildiği anda değerlidir. Özellikleri gereği sıralı, değişik boyutlarda ve düzensiz periyotlarda elde edilen bu veriler, akan veri olarak tanımlanmıştır. Akan veriler, hemen işlenmezse değerini kaybedebilir veya tamamen kaybolabilir. Bu nedenle, yapılandırılmamış verileri sürekli olarak alıp analiz edebilen ölçeklenebilir sistemlerin geliştirilmesi önemlidir. Literatürdeki çalışmaların çoğu mevcut şartlarda sistemin nasıl çalışacağı konusuna yoğunlaşmıştır.
Bu çalışma kapsamında, yukarıdaki problemlerden yola çıkarak, akan veriyi makine öğrenme algoritmaları kullanılarak anlık olarak analiz edebilen ölçeklenebilir bir sistem tasarımı amaçlanmıştır. Geliştirilen sistem ve algoritmalar, gerçek veri ve yapay veriler ile çalıştırılarak değerlendirme metrikleriyle sonuçlar elde edilmiş, ölçeklenme durumu anlık olarak izlenmiştir. Yapılan simülasyon çalışması sonucundaki veriler değerlendirilerek literatüre ve gelecek çalışmalara ışık tutmak amaçlanmıştır.

References

  • Nittel S., 2015, Real-time Sensor Data Streams, Sigspatial Special, 7(2).
  • Kolajo T., Daramola O. and Adebiyi A., 2019, Big Data Stream Analysis: A Systematic Literature Review, Journal of Big Data, 6(1), pp. 47.
  • Krishnaswamy S., Gaber M. M. and Zaslavsky A., 2005, Mining Dat Streams: A Review, ACM Sigmod Record, 34(2), pp. 18-26.
  • Jing G., Clay W., Jiawei K., Nikunj C., Mohamad M., Latifur K. and Kevin W, 2011, Facing the Reality of Data Stream Classification: Coping with Scarcity of Labeled Data, Knowledge and Information Systems, 33, pp. 213-214.
  • Bifet A., Holmes G., Kirkby R. and Pfahringer B., 2011, Data Stream Mining a Practical Approach, https://moa.cms.waikato.ac.nz/downloads/, [Ziyaret tarihi: 15 Kasım 2020].
  • Lindburg K., Stern R., Buddhika T., Pallicara S. and Ericson K., 2017, Online Scheduling and Interface Alleviation for Low-Latency, High-Troughput Processing of Data Streams, IEEE Transactions on Parallel and Distributed Systems, 28(12), pp. 3553-3569.
  • Meng X., Wang C., Guo Q., Weng Z. and Yang C., 2017, Automating Characterization Deployment in Distributed Data Stream Management Systems, IEEE Transactions on Knowledge and Data Engineering, 29(12), pp. 2669 - 2681.
  • Liu X. and Buyya R., 2019, Performance-oriented deployment of streaming applications on cloud, IEEE Tr. on Big Data, 5(1), pp. 46-59.
  • Li C., Zhang J., Zhu L. and Liu Y., 2016, The Real-time Scheduling Strategy Based on Traffic and Load Balancing in Storm, IEEE 18th International Conference on High Performance Computing and Communications, pp. 372-279.
  • Hulten G. and Domingos P., 2000, Mining high-speed data streams, In : Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 71-80.
  • Li T., Xu J. and Tang J., 2015, A Predictive Scheduling Framework for Fast and Distributed Stream Data Processing, IEEE International Conference on Big Data, pp. 333-338.
  • Sheikholeslami G., Chatterjee S. and Zhang A., 2000, WaveCluster: A Wavelet Based Clustering Approach for Spatial Data in Very Large Databases, The VLDB Journal, 8(3), pp. 289-304.
  • Vassilvitskii S., Arthur D., 2007, k-means++: The advantages of careful seeding. In Proceedings of the eighteenth ACMSIAM Symposium on Discrete Algorithms, Society for Industrial and Applied Mathematics, pp. 1027-1035.
  • Assent I., Kranen P., Baldauf C. and Seidl T., 2011, The ClusTree: Indexing micro-clusters for anytime stream mining, Knowledge and Information Systems, 29(2), pp. 249-272.
  • Ester M., Cao F., Qian W. and Zhou A., 2006, Density-Based Clustering over an Evolving Data Stream with Noise, in Proceedings of the 2006 SIAM International Conference on Data Mining.
  • [Singh A., 2017, An Efficient Hybrid- Clustream Algorithm for Stream Mining, 13th International Conference on Signal-Image Technology and Internet-Based Systems, pp. 430-436.
  • Merino, J. A., 2015, Streaming Data Clustering in MOA using the Leader Algorithm, (Yüksek Lisans tezi, Universitat Politecnica De Catalunya, Facultat D’informatica De Barcelona).
  • Machine Learning Repository, https://archive.ics.uci.edu/ml/datasets/covertye, Son erişim tarihi: 14.03.2022
Year 2022, Volume: 15 Issue: 1, 24 - 32, 27.06.2022
https://doi.org/10.54525/tbbmd.1051177

Abstract

References

  • Nittel S., 2015, Real-time Sensor Data Streams, Sigspatial Special, 7(2).
  • Kolajo T., Daramola O. and Adebiyi A., 2019, Big Data Stream Analysis: A Systematic Literature Review, Journal of Big Data, 6(1), pp. 47.
  • Krishnaswamy S., Gaber M. M. and Zaslavsky A., 2005, Mining Dat Streams: A Review, ACM Sigmod Record, 34(2), pp. 18-26.
  • Jing G., Clay W., Jiawei K., Nikunj C., Mohamad M., Latifur K. and Kevin W, 2011, Facing the Reality of Data Stream Classification: Coping with Scarcity of Labeled Data, Knowledge and Information Systems, 33, pp. 213-214.
  • Bifet A., Holmes G., Kirkby R. and Pfahringer B., 2011, Data Stream Mining a Practical Approach, https://moa.cms.waikato.ac.nz/downloads/, [Ziyaret tarihi: 15 Kasım 2020].
  • Lindburg K., Stern R., Buddhika T., Pallicara S. and Ericson K., 2017, Online Scheduling and Interface Alleviation for Low-Latency, High-Troughput Processing of Data Streams, IEEE Transactions on Parallel and Distributed Systems, 28(12), pp. 3553-3569.
  • Meng X., Wang C., Guo Q., Weng Z. and Yang C., 2017, Automating Characterization Deployment in Distributed Data Stream Management Systems, IEEE Transactions on Knowledge and Data Engineering, 29(12), pp. 2669 - 2681.
  • Liu X. and Buyya R., 2019, Performance-oriented deployment of streaming applications on cloud, IEEE Tr. on Big Data, 5(1), pp. 46-59.
  • Li C., Zhang J., Zhu L. and Liu Y., 2016, The Real-time Scheduling Strategy Based on Traffic and Load Balancing in Storm, IEEE 18th International Conference on High Performance Computing and Communications, pp. 372-279.
  • Hulten G. and Domingos P., 2000, Mining high-speed data streams, In : Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 71-80.
  • Li T., Xu J. and Tang J., 2015, A Predictive Scheduling Framework for Fast and Distributed Stream Data Processing, IEEE International Conference on Big Data, pp. 333-338.
  • Sheikholeslami G., Chatterjee S. and Zhang A., 2000, WaveCluster: A Wavelet Based Clustering Approach for Spatial Data in Very Large Databases, The VLDB Journal, 8(3), pp. 289-304.
  • Vassilvitskii S., Arthur D., 2007, k-means++: The advantages of careful seeding. In Proceedings of the eighteenth ACMSIAM Symposium on Discrete Algorithms, Society for Industrial and Applied Mathematics, pp. 1027-1035.
  • Assent I., Kranen P., Baldauf C. and Seidl T., 2011, The ClusTree: Indexing micro-clusters for anytime stream mining, Knowledge and Information Systems, 29(2), pp. 249-272.
  • Ester M., Cao F., Qian W. and Zhou A., 2006, Density-Based Clustering over an Evolving Data Stream with Noise, in Proceedings of the 2006 SIAM International Conference on Data Mining.
  • [Singh A., 2017, An Efficient Hybrid- Clustream Algorithm for Stream Mining, 13th International Conference on Signal-Image Technology and Internet-Based Systems, pp. 430-436.
  • Merino, J. A., 2015, Streaming Data Clustering in MOA using the Leader Algorithm, (Yüksek Lisans tezi, Universitat Politecnica De Catalunya, Facultat D’informatica De Barcelona).
  • Machine Learning Repository, https://archive.ics.uci.edu/ml/datasets/covertye, Son erişim tarihi: 14.03.2022
There are 18 citations in total.

Details

Primary Language Turkish
Subjects Engineering
Journal Section Makaleler(Araştırma)
Authors

Önder Aykurt 0000-0003-4949-8658

Zeynep Orman 0000-0002-0205-4198

Early Pub Date June 27, 2022
Publication Date June 27, 2022
Published in Issue Year 2022 Volume: 15 Issue: 1

Cite

APA Aykurt, Ö., & Orman, Z. (2022). Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi. Türkiye Bilişim Vakfı Bilgisayar Bilimleri Ve Mühendisliği Dergisi, 15(1), 24-32. https://doi.org/10.54525/tbbmd.1051177
AMA Aykurt Ö, Orman Z. Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi. TBV-BBMD. June 2022;15(1):24-32. doi:10.54525/tbbmd.1051177
Chicago Aykurt, Önder, and Zeynep Orman. “Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi”. Türkiye Bilişim Vakfı Bilgisayar Bilimleri Ve Mühendisliği Dergisi 15, no. 1 (June 2022): 24-32. https://doi.org/10.54525/tbbmd.1051177.
EndNote Aykurt Ö, Orman Z (June 1, 2022) Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi. Türkiye Bilişim Vakfı Bilgisayar Bilimleri ve Mühendisliği Dergisi 15 1 24–32.
IEEE Ö. Aykurt and Z. Orman, “Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi”, TBV-BBMD, vol. 15, no. 1, pp. 24–32, 2022, doi: 10.54525/tbbmd.1051177.
ISNAD Aykurt, Önder - Orman, Zeynep. “Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi”. Türkiye Bilişim Vakfı Bilgisayar Bilimleri ve Mühendisliği Dergisi 15/1 (June 2022), 24-32. https://doi.org/10.54525/tbbmd.1051177.
JAMA Aykurt Ö, Orman Z. Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi. TBV-BBMD. 2022;15:24–32.
MLA Aykurt, Önder and Zeynep Orman. “Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi”. Türkiye Bilişim Vakfı Bilgisayar Bilimleri Ve Mühendisliği Dergisi, vol. 15, no. 1, 2022, pp. 24-32, doi:10.54525/tbbmd.1051177.
Vancouver Aykurt Ö, Orman Z. Akan Verinin Makine Öğrenme Algoritmaları Kullanılarak Ölçeklenmesi. TBV-BBMD. 2022;15(1):24-32.

Article Acceptance

Use user registration/login to upload articles online.

The acceptance process of the articles sent to the journal consists of the following stages:

1. Each submitted article is sent to at least two referees at the first stage.

2. Referee appointments are made by the journal editors. There are approximately 200 referees in the referee pool of the journal and these referees are classified according to their areas of interest. Each referee is sent an article on the subject he is interested in. The selection of the arbitrator is done in a way that does not cause any conflict of interest.

3. In the articles sent to the referees, the names of the authors are closed.

4. Referees are explained how to evaluate an article and are asked to fill in the evaluation form shown below.

5. The articles in which two referees give positive opinion are subjected to similarity review by the editors. The similarity in the articles is expected to be less than 25%.

6. A paper that has passed all stages is reviewed by the editor in terms of language and presentation, and necessary corrections and improvements are made. If necessary, the authors are notified of the situation.

0

.   This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.