EN
Using Word Embeddings for Ontology Enrichment
Abstract
Word embeddings, distributed word representations in a reduced linear space, show a lot of promise for accomplishing Natural Language Processing (NLP) tasks in an unsupervised manner. In this study, we investigate if the success of word2vec, a Neural Networks based word embeddings algorithm, can be replicated in an aggluginative language like Turkish. Turkish is more challenging than languages like English for complex NLP tasks because of her rich morphology. We picked ontology enrichment, again a relatively harder NLP task, as our test application. Firstly, we show how ontological relations can be extracted automaticaly from Turkish Wikipedia to construct a gold standard. Then by running experiments we show that the word vector representations produced by word2vec are useful to detect ontological relations encoded in Wikipedia. We propose a simple but yet effective weakly supervised ontology enrichment algorithm where for a given word a few know ontologically related concepts coupled with similarity scores computed via word2vec models can result in discovery of other related concepts. We argue how our algorithm can be improved and augmented to make it a viable component of an ontoloy learning and population framework.
Keywords
References
- Collobert R, Weston J, Bottou L, Karlen M, Kavukcuoglu K, Kuksa P. 2011. Natural language processing (almost) from scratch. The Journal of Machine Learning Research. 12:2493-537.
- Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient Estimation of Word Representations in Vector Space. In Proceedings of Workshop at ICLR.
- Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (pp. 3111-3119).
- Petasis G, Karkaletsis V, Paliouras G, Krithara A, Zavitsanos E. 2011. Ontology Population and Enrichment: State of the Art. In Knowledge-Driven Multimedia Information Extraction and Ontology Evolution (pp. 134-166). Springer-Verlag.
- Zouaq A, Gasevic D, Hatala M. 2011. Towards Open Ontology Learning and Filtering. Information Systems. 36(7):1064-81.
- Tanev H, Magnini B. 2008. Weakly supervised approaches for ontology population. In Proceeding of the 2008 conference on Ontology Learning and Population: Bridging the Gap between Text and Knowledge (pp. 129-143).
- Rong X. 2014. word2vec parameter learning explained. arXiv preprint arXiv:1411.2738.
- Pennington J, Socher R, Manning CD. 2014. Glove: Global Vectors for Word Representation. In EMNLP 2014 (Vol. 14, pp. 1532-1543).
Details
Primary Language
English
Subjects
-
Journal Section
-
Authors
İzzet Pembeci
This is me
Publication Date
November 1, 2016
Submission Date
June 19, 2016
Acceptance Date
-
Published in Issue
Year 1970 Volume: 4 Number: 3
APA
Pembeci, İ. (2016). Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering, 4(3), 49-56. https://doi.org/10.18201/ijisae.58806
AMA
1.Pembeci İ. Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering. 2016;4(3):49-56. doi:10.18201/ijisae.58806
Chicago
Pembeci, İzzet. 2016. “Using Word Embeddings for Ontology Enrichment”. International Journal of Intelligent Systems and Applications in Engineering 4 (3): 49-56. https://doi.org/10.18201/ijisae.58806.
EndNote
Pembeci İ (November 1, 2016) Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering 4 3 49–56.
IEEE
[1]İ. Pembeci, “Using Word Embeddings for Ontology Enrichment”, International Journal of Intelligent Systems and Applications in Engineering, vol. 4, no. 3, pp. 49–56, Nov. 2016, doi: 10.18201/ijisae.58806.
ISNAD
Pembeci, İzzet. “Using Word Embeddings for Ontology Enrichment”. International Journal of Intelligent Systems and Applications in Engineering 4/3 (November 1, 2016): 49-56. https://doi.org/10.18201/ijisae.58806.
JAMA
1.Pembeci İ. Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering. 2016;4:49–56.
MLA
Pembeci, İzzet. “Using Word Embeddings for Ontology Enrichment”. International Journal of Intelligent Systems and Applications in Engineering, vol. 4, no. 3, Nov. 2016, pp. 49-56, doi:10.18201/ijisae.58806.
Vancouver
1.İzzet Pembeci. Using Word Embeddings for Ontology Enrichment. International Journal of Intelligent Systems and Applications in Engineering. 2016 Nov. 1;4(3):49-56. doi:10.18201/ijisae.58806
Cited By
An integrated framework for automatic ontology learning from unstructured repair text data for effective fault detection and isolation in automotive domain
Computers in Industry
https://doi.org/10.1016/j.compind.2020.103338Quality assurance of biomedical terminologies and ontologies
Journal of Biomedical Informatics
https://doi.org/10.1016/j.jbi.2018.09.006Concept placement using BERT trained by transforming and summarizing biomedical ontology structure
Journal of Biomedical Informatics
https://doi.org/10.1016/j.jbi.2020.103607Automatic Structuring of Ontology Terms Based on Lexical Granularity and Machine Learning: Algorithm Development and Validation
JMIR Medical Informatics
https://doi.org/10.2196/22333HURON: A Quantitative Framework for Assessing Human Readability in Ontologies
IEEE Access
https://doi.org/10.1109/ACCESS.2023.3316512Ontology enrichment from opinions using machine learning algorithms
International Journal of Information Technology
https://doi.org/10.1007/s41870-024-01873-3