Research Article

Novel Approach for Detecting the Number of Columns of a Résumé

Volume: 12 Number: 1 March 26, 2025
EN

Novel Approach for Detecting the Number of Columns of a Résumé

Abstract

In recruitment processes, manually reviewing résumés is a highly time-consuming job. In order to reduce the cost of these reviews, Information Extraction tasks have been introduced to extract the structure of the document and the personal information contained within. However, because there is no consensus on a standard structure of résumés, i.e., each résumé has its own distinctive layout, column numbers, or text properties, an accurate extraction process becomes highly challenging. This study addresses a part of this problem. We focus on the problem of estimating the number of columns in résumés, as we experience in the further processes that knowing the number of columns facilitates the separation of the main sections of the résumés, hence the analysis of the finer subsections. We employ the coordinates of the text blocks that build up a résumé. We hypothesize that the coordinates of the text blocks carry information on the number of columns. We define the problem in a clustering context. We proposed a novel clustering approaches dedicated to finding the number of columns in a résumé by the separation of the text block coordinates. The experiments are conducted on a dataset of the résumés of real applicants in two languages: Turkish and English. The results reveal that hybrid approaches that use the intermediate methods perform better than the individual methods. Furthermore, these findings could be extended to any unstructured textual data in any language and document format

Keywords

References

  1. Adnan, K., & Akbar, R. (2019). An analytical study of information extraction from unstructured and multidimensional big data. Journal of Big Data, 6(1), 1-38. https://doi.org/10.1186/s40537-019-0254-8
  2. Alamelu, M., Kumar, D. S., Sanjana, R., Sree, J. S., Devi, A. S., & Kavitha, D. (2021, December). Resume validation and filtration using natural language processing. In 2021 10th International conference on internet of everything, microwave engineering, communication and networks (IEMECON) (pp. 1-5). IEEE. https://doi.org/10.1109/IEMECON53809.2021.9689075
  3. Chen, K., Yin, F., & Liu, C. L. (2013, August). Hybrid page segmentation with efficient whitespace rectangles extraction and grouping. In 2013 12th International Conference on Document Analysis and Recognition (pp. 958-962). IEEE. https://doi.org/10.1109/ICDAR.2013.194
  4. Chen, J., Zhang, C., & Niu, Z. (2018). A Two‐Step Resume Information Extraction Algorithm. Mathematical Problems in Engineering, 2018(1), 5761287. https://doi.org/10.1155/2018/5761287
  5. Çelik, D., & Elçi, A. (2012). An ontology-based information extraction approach for résumés. In Proceedings of the 2012 International Conference on Pervasive Computing and the Networked World ICPCA/SWS’12 (p. 165–179). Berlin, Heidelberg: Springer-Verlag. https://doi.org/10.1007/978-3-642-37015-1_14
  6. Das, P., Pandey, M., & Rautaray, S. S. (2018). A CV parser model using entity extraction process and big data tools. International Journal of Information Technology and Computer Science, 9(2), 21-31. https://doi.org/10.5815/ijitcs.2018.09.03
  7. Ester, M., Kriegel, H. P., Sander, J., & Xu, X. (1996, August). A density-based algorithm for discovering clusters in large spatial databases with noise. In kdd (Vol. 96, No. 34, pp. 226-231).
  8. Farkas, R., Dobó, A., Kurai, Z., Miklós, I., Nagy, Á., Vincze, V., & Zsibrita, J. (2014). Information extraction from Hungarian, English and German CVs for a career portal. In Mining Intelligence and Knowledge Exploration: Second International Conference, MIKE 2014, Cork, Ireland, December 10-12, 2014. Proceedings (pp. 333-341). Springer International Publishing. https://doi.org/10.1007/978-3-319-13817-6_32

Details

Primary Language

English

Subjects

Computing Applications in Life Sciences

Journal Section

Research Article

Publication Date

March 26, 2025

Submission Date

February 10, 2025

Acceptance Date

March 12, 2025

Published in Issue

Year 2025 Volume: 12 Number: 1

APA
Bali, Y., Orman, G. K., & Turhan, S. N. (2025). Novel Approach for Detecting the Number of Columns of a Résumé. Gazi University Journal of Science Part A: Engineering and Innovation, 12(1), 127-153. https://doi.org/10.54287/gujsa.1636051
AMA
1.Bali Y, Orman GK, Turhan SN. Novel Approach for Detecting the Number of Columns of a Résumé. GU J Sci, Part A. 2025;12(1):127-153. doi:10.54287/gujsa.1636051
Chicago
Bali, Yavuz, Günce Keziban Orman, and Sultan Nezihe Turhan. 2025. “Novel Approach for Detecting the Number of Columns of a Résumé”. Gazi University Journal of Science Part A: Engineering and Innovation 12 (1): 127-53. https://doi.org/10.54287/gujsa.1636051.
EndNote
Bali Y, Orman GK, Turhan SN (March 1, 2025) Novel Approach for Detecting the Number of Columns of a Résumé. Gazi University Journal of Science Part A: Engineering and Innovation 12 1 127–153.
IEEE
[1]Y. Bali, G. K. Orman, and S. N. Turhan, “Novel Approach for Detecting the Number of Columns of a Résumé”, GU J Sci, Part A, vol. 12, no. 1, pp. 127–153, Mar. 2025, doi: 10.54287/gujsa.1636051.
ISNAD
Bali, Yavuz - Orman, Günce Keziban - Turhan, Sultan Nezihe. “Novel Approach for Detecting the Number of Columns of a Résumé”. Gazi University Journal of Science Part A: Engineering and Innovation 12/1 (March 1, 2025): 127-153. https://doi.org/10.54287/gujsa.1636051.
JAMA
1.Bali Y, Orman GK, Turhan SN. Novel Approach for Detecting the Number of Columns of a Résumé. GU J Sci, Part A. 2025;12:127–153.
MLA
Bali, Yavuz, et al. “Novel Approach for Detecting the Number of Columns of a Résumé”. Gazi University Journal of Science Part A: Engineering and Innovation, vol. 12, no. 1, Mar. 2025, pp. 127-53, doi:10.54287/gujsa.1636051.
Vancouver
1.Yavuz Bali, Günce Keziban Orman, Sultan Nezihe Turhan. Novel Approach for Detecting the Number of Columns of a Résumé. GU J Sci, Part A. 2025 Mar. 1;12(1):127-53. doi:10.54287/gujsa.1636051