Research Article

Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms

Volume: 8 Number: 3 October 1, 2020
EN

Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms

Abstract

Text mining studies on job ads have become widespread in recent years to determine the qualifications required for each position. It can be said that the researches made for Turkish are limited while a large resource pool is encountered for the English language. Kariyer.Net is the biggest company for the job ads in Turkey and 99% of the ads are Turkish. Therefore, there is a necessity to develop novel Natural Language Processing (NLP) models in Turkish for analysis of this big database. In this study, the job ads of Kariyer.Net have been analyzed, and by using a hybrid clustering algorithm, the hidden associations in this dataset as the big data have been discovered. Firstly, all ads in the form of HTML codes have been transformed into regular sentences by the means of extracting HTML codes to inner texts. Then, these inner texts containing the core ads have been converted into the sub ads by traditional methods. After these NLP steps, hybrid clustering algorithms have been used and the same ads expressed with the different sentences could be managed to be detected. For the analysis, 57 positions about Information Technology sectors with 6,897 ad texts have been focused on. As a result, it can be claimed that the clusters obtained contain useful outcomes and the model proposed can be used to discover common and unique ads for each position.

Keywords

Supporting Institution

Kariyer.Net R&D Center

Thanks

This study is supported by Kariyer.Net R&D Center. In addition, we thank the research team for the dataset.

References

  1. R. Loth, D. Battistelli, F. R. Chaumartin, H. De Mazancourt, J. L. Minel, and A. Vinckx, “Linguistic information extraction for job ads (SIRE project),” In 9th RIAO: Adaptivity, Personalization and Fusion of Heterogeneous Information, 2010, pp. 222-224.
  2. J. L. F. D. M. Pombo, “Landing on the right job: a machine learning approach to match candidates with jobs applying semantic embeddings,” Doctoral dissertation, 2019.
  3. J. Grüger, and G. J. Schneider, “Automated analysis of job requirements for computer scientists in online job advertisements,” in 15th International Conference on Web Information Systems and Technologies, 2019, pp 226-233.
  4. M. A. Kennan, P. Willard, P., D. C. Kecmanovic, and C. S. Wilson, “25. IS early career job advertisements: A content analysis,” in 11th Pacific-Asia Conference on Information Systems, New Zealand, 2007, pp. 340-353.
  5. Y. Choi, and E. Rasmussen, “What qualifications and skills are important for digital librarian positions in academic libraries? A job advertisement analysis,” The Journal of Academic Librarianship, vol. 35, no. 5, pp. 457–467, 2009.
  6. M. Pember, “Content analysis of recordkeeping job advertisements in Western Australia: Knowledge and skills required by employers,” Australian Academic & Research Libraries, vol. 34, no 3, pp. 194-210, 2003.
  7. D. C. Angelides. “From the present to the future of civil engineering education in Europe: A strategic approach,” in Proceedings of the International Meeting in Civil Engineering Education, Ciudad Real, Spain, 2003, pp. 1-21.
  8. C. Kwon Lee, and H. Han, “Analysis of skills requirement for entry-level programmer/analysts in fortune 500 corporations,” Journal of Information Systems Education, vol. 19, no. 1, pp. 17-27, 2008.

Details

Primary Language

English

Subjects

Engineering

Journal Section

Research Article

Publication Date

October 1, 2020

Submission Date

September 21, 2020

Acceptance Date

September 30, 2020

Published in Issue

Year 1970 Volume: 8 Number: 3

APA
Doğan, Y., Dalkılıç, F., Kut, R. A., Kara, K. C., & Takazoğlu, U. (2020). Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms. International Journal of Applied Mathematics Electronics and Computers, 8(3), 76-84. https://doi.org/10.18100/ijamec.797572
AMA
1.Doğan Y, Dalkılıç F, Kut RA, Kara KC, Takazoğlu U. Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms. International Journal of Applied Mathematics Electronics and Computers. 2020;8(3):76-84. doi:10.18100/ijamec.797572
Chicago
Doğan, Yunus, Feriştah Dalkılıç, Recep Alp Kut, Kemal Can Kara, and Uygar Takazoğlu. 2020. “Discovering the Same Job Ads Expressed With the Different Sentences by Using Hybrid Clustering Algorithms”. International Journal of Applied Mathematics Electronics and Computers 8 (3): 76-84. https://doi.org/10.18100/ijamec.797572.
EndNote
Doğan Y, Dalkılıç F, Kut RA, Kara KC, Takazoğlu U (October 1, 2020) Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms. International Journal of Applied Mathematics Electronics and Computers 8 3 76–84.
IEEE
[1]Y. Doğan, F. Dalkılıç, R. A. Kut, K. C. Kara, and U. Takazoğlu, “Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms”, International Journal of Applied Mathematics Electronics and Computers, vol. 8, no. 3, pp. 76–84, Oct. 2020, doi: 10.18100/ijamec.797572.
ISNAD
Doğan, Yunus - Dalkılıç, Feriştah - Kut, Recep Alp - Kara, Kemal Can - Takazoğlu, Uygar. “Discovering the Same Job Ads Expressed With the Different Sentences by Using Hybrid Clustering Algorithms”. International Journal of Applied Mathematics Electronics and Computers 8/3 (October 1, 2020): 76-84. https://doi.org/10.18100/ijamec.797572.
JAMA
1.Doğan Y, Dalkılıç F, Kut RA, Kara KC, Takazoğlu U. Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms. International Journal of Applied Mathematics Electronics and Computers. 2020;8:76–84.
MLA
Doğan, Yunus, et al. “Discovering the Same Job Ads Expressed With the Different Sentences by Using Hybrid Clustering Algorithms”. International Journal of Applied Mathematics Electronics and Computers, vol. 8, no. 3, Oct. 2020, pp. 76-84, doi:10.18100/ijamec.797572.
Vancouver
1.Yunus Doğan, Feriştah Dalkılıç, Recep Alp Kut, Kemal Can Kara, Uygar Takazoğlu. Discovering the same job ads expressed with the different sentences by using hybrid clustering algorithms. International Journal of Applied Mathematics Electronics and Computers. 2020 Oct. 1;8(3):76-84. doi:10.18100/ijamec.797572