Research Article

Log Analysis with Hadoop MapReduce

Volume: 1 Number: 1 June 30, 2021
  • Gligor Risteski
  • Mihiri Chathurika
  • Beyza Ali
  • Atanas Hristov *
EN TR

Log Analysis with Hadoop MapReduce

Abstract

Pretty much every part of life now results in the generation of data. Logs are documentation of events or records of system activities and are created automatically through IT systems. Log data analysis is a process of making sense of these records. Log data often grows quickly and the conventional database solutions run short for dealing with a large volume of log files. Hadoop, having a wide area of applications for Big Data analysis, provides a solution for this problem. In this study, Hadoop was installed on two virtual machines. Log files generated by a Python script were analyzed in order to evaluate the system activities. The aim was to validate the importance of Hadoop in meeting the challenge of dealing with Big Data. The performed experiments show that analyzing logs with Hadoop MapReduce makes the data processing and detection of malfunctions and defects faster and simpler.

Keywords

References

  1. Sethy, R. et al. Big Data Analysis using Hadoop: A Survey. International Journal of Advanced Research in Computer Science and Software Engineering 5(7), 2015, pp. 1153-1157.
  2. Schneider, R.D. Hadoop For Dummies, Special Edition. John Wiley & Sons Canada, Ltd. 2012.
  3. Borthakur, D. HDFS architecture. Document on Hadoop Wiki. http://hadoop. apache. org/common/docs/r0 20. 2010.
  4. Vavilapalli, V. K.; et al. Apache hadoop yarn: Yet another resource negotiator. Proceedings of the 4th annual Symposium on Cloud Computing. ACM, 2013.
  5. Hadoop, Apache. Hadoop Archives Guide. The Apache Software Foundation, http:// hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html (2019). Retrieved Oct. 15, 2019.
  6. Kaur, I. et al. Research Paper on Big Data and Hadoop. IJCST, 7(4), 2016, pp. 50-53.
  7. Dean, J. and Ghemawat, S. MapReduce: Simplified data processing on large clusters. Proceedings of Operating Systems Design and Implementation, 2004.
  8. Yang, H. et al. Map-reduce-merge: simplified relational data processing on large clusters. Proceedings of the ACM SIGMOD international conference on Management of data. ACM, 2007.

Details

Primary Language

English

Subjects

Computer Software

Journal Section

Research Article

Authors

Gligor Risteski This is me
Macedonia

Mihiri Chathurika This is me
Macedonia

Beyza Ali This is me
Macedonia

Atanas Hristov * This is me
0000-0003-2741-8370
Macedonia

Publication Date

June 30, 2021

Submission Date

October 4, 2020

Acceptance Date

December 2, 2020

Published in Issue

Year 2021 Volume: 1 Number: 1

APA
Risteski, G., Chathurika, M., Ali, B., & Hristov, A. (2021). Log Analysis with Hadoop MapReduce. Journal of Emerging Computer Technologies, 1(1), 1-5. https://izlik.org/JA39EN76DZ
AMA
1.Risteski G, Chathurika M, Ali B, Hristov A. Log Analysis with Hadoop MapReduce. JECT. 2021;1(1):1-5. https://izlik.org/JA39EN76DZ
Chicago
Risteski, Gligor, Mihiri Chathurika, Beyza Ali, and Atanas Hristov. 2021. “Log Analysis With Hadoop MapReduce”. Journal of Emerging Computer Technologies 1 (1): 1-5. https://izlik.org/JA39EN76DZ.
EndNote
Risteski G, Chathurika M, Ali B, Hristov A (June 1, 2021) Log Analysis with Hadoop MapReduce. Journal of Emerging Computer Technologies 1 1 1–5.
IEEE
[1]G. Risteski, M. Chathurika, B. Ali, and A. Hristov, “Log Analysis with Hadoop MapReduce”, JECT, vol. 1, no. 1, pp. 1–5, June 2021, [Online]. Available: https://izlik.org/JA39EN76DZ
ISNAD
Risteski, Gligor - Chathurika, Mihiri - Ali, Beyza - Hristov, Atanas. “Log Analysis With Hadoop MapReduce”. Journal of Emerging Computer Technologies 1/1 (June 1, 2021): 1-5. https://izlik.org/JA39EN76DZ.
JAMA
1.Risteski G, Chathurika M, Ali B, Hristov A. Log Analysis with Hadoop MapReduce. JECT. 2021;1:1–5.
MLA
Risteski, Gligor, et al. “Log Analysis With Hadoop MapReduce”. Journal of Emerging Computer Technologies, vol. 1, no. 1, June 2021, pp. 1-5, https://izlik.org/JA39EN76DZ.
Vancouver
1.Gligor Risteski, Mihiri Chathurika, Beyza Ali, Atanas Hristov. Log Analysis with Hadoop MapReduce. JECT [Internet]. 2021 Jun. 1;1(1):1-5. Available from: https://izlik.org/JA39EN76DZ
Journal of Emerging Computer Technologies
is indexed and abstracted by
Harvard Hollis, Scilit, ROAD, Google Scholar, OpenAIRE

Publisher
Izmir Academy Association

88x31.png