Research Article

Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts

Volume: 15 Number: 2 December 31, 2023
EN

Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts

Abstract

Relation graphs provide useful tools for structural and relational analyses of highly complex multi-component systems. Probabilistic relation graph models can represent relations between system components by their probabilistic links. These graph types have been widely used for the graphical representation of Markov models and bigram probabilities. This study presents an implication of relational similarities within probabilistic graph models of textual entries. The article discusses several utilization examples of two fundamental similarity measures in the probabilistic analysis of short texts. To this end, the construction of probabilistic graph models by using bigram probability matrices of textual entries is illustrated, and vector spaces of input word-vectors and output word-vectors are formed. In this vector space, the utilization of cosine similarity and mean squared error measures are demonstrated to evaluate the probabilistic relational similarity between lexeme pairs in short texts. By using probabilistic relation graphs of the short texts, relational interchangeability analyses of lexeme pairs are conducted, and confidence index parameters are defined to express the reliability of these analyses. Potential applications of these graphs in language processing and linguistics are discussed on the basis of the analysis results of example texts. The performance of the applied similarity measures is evaluated in comparison to the similarity index of the word2vec language model. Results of the comparative study in one of the illustrative examples reveal that synonyms with 0.18157 word2vec similarity value scored 1.0 cosine similarity value according to the proposed method.

Keywords

References

  1. Alnahas, D., Alagoz, B.B., Probabilistic relational connectivity analysis of bigram models, In 2019 International Artificial Intelligence and Data Processing Symposium (IDAP) (Malatya, Turkey, 2019), 379–384.
  2. Alnahas, D., Alagoz, B.B., A theoretical study on event spreading prediction by probabilistic connectivity analysis in dispersive networks, In 2019 International Artificial Intelligence and Data Processing Symposium (IDAP) (Malatya, Turkey, 2019), 590–595.
  3. Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C., A neural probabilistic language model, Journal of Machine Learning Research, 3(2003), 1137–1155.
  4. Conte, D., Foggia, P., Sansone, C., Vento, M., Thirty years of graph matching in pattern recognition, International Journal of Pattern Recognition and Artificial Intelligence, 18(2004), 265–298.
  5. Dogus, B., Guzel, G., Development of matlab tool for text analysis, Capstone Project presented at Inonu University, Computer Engineering Department, (2018).
  6. Erkan, G., Radev, D. R., Lexrank: Graph-based lexical centrality as salience in text summarization, Journal of Artificial Intelligence Research, 22(2004), 457–479.
  7. Evert, S., Baroni, M., Lenci, A., Distributional semantic models, In Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT): Tutorial Abstracts (Los Angeles, CA, USA, June 2010), Association for Computational Linguistics, 15–18.
  8. Fallucchi, F., Zanzotto, F.M., Transitivity in semantic relation learning, In Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering (NLPKE-2010), (2010), IEEE, 1–8.

Details

Primary Language

English

Subjects

Software Testing, Verification and Validation

Journal Section

Research Article

Publication Date

December 31, 2023

Submission Date

January 23, 2023

Acceptance Date

August 8, 2023

Published in Issue

Year 2023 Volume: 15 Number: 2

APA
Alnahas, D., Ateş, A., Aydın, A. A., & Alagöz, B. B. (2023). Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts. Turkish Journal of Mathematics and Computer Science, 15(2), 334-354. https://doi.org/10.47000/tjmcs.1240729
AMA
1.Alnahas D, Ateş A, Aydın AA, Alagöz BB. Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts. TJMCS. 2023;15(2):334-354. doi:10.47000/tjmcs.1240729
Chicago
Alnahas, Dima, Abdullah Ateş, Ahmet Arif Aydın, and Barış Baykant Alagöz. 2023. “Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts”. Turkish Journal of Mathematics and Computer Science 15 (2): 334-54. https://doi.org/10.47000/tjmcs.1240729.
EndNote
Alnahas D, Ateş A, Aydın AA, Alagöz BB (December 1, 2023) Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts. Turkish Journal of Mathematics and Computer Science 15 2 334–354.
IEEE
[1]D. Alnahas, A. Ateş, A. A. Aydın, and B. B. Alagöz, “Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts”, TJMCS, vol. 15, no. 2, pp. 334–354, Dec. 2023, doi: 10.47000/tjmcs.1240729.
ISNAD
Alnahas, Dima - Ateş, Abdullah - Aydın, Ahmet Arif - Alagöz, Barış Baykant. “Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts”. Turkish Journal of Mathematics and Computer Science 15/2 (December 1, 2023): 334-354. https://doi.org/10.47000/tjmcs.1240729.
JAMA
1.Alnahas D, Ateş A, Aydın AA, Alagöz BB. Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts. TJMCS. 2023;15:334–354.
MLA
Alnahas, Dima, et al. “Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts”. Turkish Journal of Mathematics and Computer Science, vol. 15, no. 2, Dec. 2023, pp. 334-5, doi:10.47000/tjmcs.1240729.
Vancouver
1.Dima Alnahas, Abdullah Ateş, Ahmet Arif Aydın, Barış Baykant Alagöz. Revisiting Probabilistic Relation Analysis: Using Probabilistic Relation Graphs for Relational Similarity Analysis of Words in Short Texts. TJMCS. 2023 Dec. 1;15(2):334-5. doi:10.47000/tjmcs.1240729

Cited By