Research Article

Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection

Volume: 12 Number: 3 September 30, 2023
EN

Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection

Abstract

In this research, we delve deep into the realm of Targeted Social Engineering Email Detection, presenting a novel approach that harnesses the power of Lambda Architecture (LA). Our innovative methodology strategically segments the BERT model into two distinct components: the embedding generator and the classification segment. This segmentation not only optimizes resource consumption but also improves system efficiency, making it a pioneering step in the field. Our empirical findings, derived from a rigorous comparison between the fastText and BERT models, underscore the superior performance of the latter. Specifically, The BERT model has high precision rates for identifying malicious and benign emails, with impressive recall values and F1 scores. Its overall accuracy rate was 0.9988, with a Matthews Correlation Coefficient value of 0.9978. In comparison, the fastText model showed lower precision rates. Leveraging principles reminiscent of the Lambda architecture, our study delves into the performance dynamics of data processing models. The Separated-BERT (Sep-BERT) model emerges as a robust contender, adept at managing both real-time (stream) and large-scale (batch) data processing. Compared to the traditional BERT, Sep-BERT showcased superior efficiency, with reduced memory and CPU consumption across diverse email sizes and ingestion rates. This efficiency, combined with rapid inference times, positions Sep-BERT as a scalable and cost-effective solution, aligning well with the demands of Lambda- inspired architectures. This study marks a significant step forward in the fields of big data and cybersecurity. By introducing a novel methodology and demonstrating its efficacy in detecting targeted social engineering emails, we not only advance the state of knowledge in these domains but also lay a robust foundation for future research endeavors, emphasizing the transformative potential of integrating advanced big data frameworks with machine learning models.

Keywords

References

  1. [1] A. Papanikolaou, A. Alevizopoulos, C. Ilioudis, K. Demertzis, and K. Rantos, “A blockchained automl network traffic analyzer to industrial cyber defense and protection,” Electronics, vol. 12, no. 6, 2023.
  2. [2] G. Manogaran, C. Thota, D. Lopez, and R. Sundarasekar, “Big data security intelligence for healthcare industry 4.0,” Cyberse- curity for Industry 4.0: Analysis for Design and Manufacturing, pp. 103–126, 2017.
  3. [3] A. Papanikolaou, A. Alevizopoulos, C. Ilioudis, K. Demertzis, and K. Rantos, “An automl network traffic analyzer for cyber threat detection,” International Journal of Information Security, pp. 1–20, 2023.
  4. [4] Y. Wang, W. Ma, H. Xu, Y. Liu, and P. Yin, “A lightweight multi-view learning approach for phishing attack detection using transformer with mixture of experts,” Applied Sciences, vol. 13, no. 13, 2023.
  5. [5] J. Ramprasath, S. Priyanka, R. Manudev, and M. Gokul, “Identification and mitigation of phishing email attacks using deep learning,” in 2023 3rd International Conference on Ad- vance Computing and Innovative Technologies in Engineering (ICACITE), 2023, pp. 466–470.
  6. [6] A. Mughaid, S. AlZu’bi, A. Hnaif, S. Taamneh, A. Alnajjar, and E. A. Elsoud, “An intelligent cyber security phishing detection system using deep learning techniques,” Cluster Computing, vol. 25, no. 6, pp. 3819–3828, 2022.
  7. [7] B. Naqvi, K. Perova, A. Farooq, I. Makhdoom, S. Oyedeji, and J. Porras, “Mitigation strategies against the phishing attacks: A systematic literature review,” Computers & Security, vol. 132, p. 103387, 2023.
  8. [8] T. Muralidharan and N. Nissim, “Improving malicious email detection through novel designated deep-learning architectures utilizing entire email,” Neural Networks, vol. 157, pp. 257–279, 2023.

Details

Primary Language

English

Subjects

Cybersecurity and Privacy (Other)

Journal Section

Research Article

Publication Date

September 30, 2023

Submission Date

August 7, 2023

Acceptance Date

September 25, 2023

Published in Issue

Year 2023 Volume: 12 Number: 3

APA
Demirezen, M. U., & Selcen Navruz, T. (2023). Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection. International Journal of Information Security Science, 12(3), 29-59. https://doi.org/10.55859/ijiss.1338813
AMA
1.Demirezen MU, Selcen Navruz T. Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection. IJISS. 2023;12(3):29-59. doi:10.55859/ijiss.1338813
Chicago
Demirezen, Mustafa Umut, and Tuğba Selcen Navruz. 2023. “Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection”. International Journal of Information Security Science 12 (3): 29-59. https://doi.org/10.55859/ijiss.1338813.
EndNote
Demirezen MU, Selcen Navruz T (September 1, 2023) Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection. International Journal of Information Security Science 12 3 29–59.
IEEE
[1]M. U. Demirezen and T. Selcen Navruz, “Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection”, IJISS, vol. 12, no. 3, pp. 29–59, Sept. 2023, doi: 10.55859/ijiss.1338813.
ISNAD
Demirezen, Mustafa Umut - Selcen Navruz, Tuğba. “Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection”. International Journal of Information Security Science 12/3 (September 1, 2023): 29-59. https://doi.org/10.55859/ijiss.1338813.
JAMA
1.Demirezen MU, Selcen Navruz T. Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection. IJISS. 2023;12:29–59.
MLA
Demirezen, Mustafa Umut, and Tuğba Selcen Navruz. “Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection”. International Journal of Information Security Science, vol. 12, no. 3, Sept. 2023, pp. 29-59, doi:10.55859/ijiss.1338813.
Vancouver
1.Mustafa Umut Demirezen, Tuğba Selcen Navruz. Lambda Architecture-Based Big Data System for Large-Scale Targeted Social Engineering Email Detection. IJISS. 2023 Sep. 1;12(3):29-5. doi:10.55859/ijiss.1338813