TY - JOUR TT - TOPIC MODEL IMPLEMENTATION TO FIND RELATED DOCUMENTS IN CORPORATE ARCHIVES IN REAL LIFE: “A CASE SCENARIO ON KNOWLEDGE RETRIEVAL” AU - Medeni, İhsan Tolga AU - Medeni, Tunç Durmuş PY - 2013 DA - June JF - International Journal of eBusiness and eGovernment Studies JO - IJEBEG PB - Sosyal Bilimler Araştırmaları Derneği WT - DergiPark SN - 2146-0744 SP - 98 EP - 107 VL - 5 IS - 1 KW - Topic Model KW - Knowledge Extraction KW - Latent Semantic Analysis (LSA) KW - Probabilistic Latent Semantic Analysis (pLSA) KW - Latent Dirichlet Allocation (LDA) N2 - Today’s organizations were mostly built over their documents. These documents are very crucial sources of knowledge. Even they know the existence of these documents, most of the time, it is nearly impossible to extract captive knowledge inside. In these conditions, organizations choose re-prepare same document again rather than finding proper documents in the archives. On the other hand, finding these documents would save precious time and decrease redundancy of the work. Topic model idea basically focuses on extraction of knowledge from these types of documents. In this study, our aim is to give a summary of Topic Model research and try to explain latest model concept over an imaginary case scenario CR - Blei, Ng, Jordan,(2003), “Latent Dirichlet Allocation”, Journal of. Machine. Learning. Vol..3, pp. 993–1022. CR - Davenport, Prusak, (2000), Working Knowledge:How Organizations Manage CR - What They Know, Boston, Harward Business School Press. Deerwester, Dumais, Furnas, Landauer, Harshman (1990), “Indexing by Latent CR - Semantic Analysis” Journal of the American Society for Information Science, , Vol.41,No.6,pp.391-407. Gethers, Poshyvanyk,(2010),“Using Relational Topic Models to Capture CR - Coupling among Classes in Object-Oriented Software Systems”, IEEE International Conference on Software Maintenance, 2010. CR - Girolami, Kabán, (2003), “On an equivalence between PLSI and LDA”, in: Proc. CR - Annu. ACM SIGIR Int. Conf. on Research and Development in Information Retrieval, Toronto, Ontario, Canada, , pp. 433–434. Griffiths, Steyvers, (2004) “Finding scientific topics”, Proc. Nat. Acad. Sci. Vol.101 No.1 , pp. 5228–5235. CR - Hofmann (1999), “Probabilistic latent semantic indexing”, in: Proc. 22nd Annu. CR - ACM SIGIR Int. Conf. on Research and Development in Information Retrieval, Berkeley, CA, USA, , pp. 50–57. Kakkonen, Myller, Sutinen, Timonen.(2008) “Comparison of Dimension CR - Reduction Methods for Automated Essay Grading”, Educational Technology & Society;Vol.11, No.3,pp.275-288. Linstead, Rigor, Bajracharya, Lopes, Baldi,(2007), “Mining concepts from code with probabilistic topic models”, in: Proc. 22nd IEEE/ACM Int. Conf. on Automated CR - Lukins, Kraft, Etzkorn.(2010),”Bug localization using latent Dirichlet allocation”. CR - Information and Software Technology Vol.52, No.9,pp.972-990. Poshyvanyk, Guéhéneuc, Marcus, G. Antoniol, Rajlich (2006), “Combining probabilistic ranking and latent semantic indexing for feature location”, in:Proc. th IEEE Int. Conf. on Program Comprehension, Athens, Greece, , pp. 137–148. CR - Steyvers, Griffiths, (2007), “Probabilistic topic models”, (in: Landauer, CR - McNamara, Dennis, Kintsch-Ed, Handbook of Latent Semantic Analysis, Lawrence Erlbaum Associates.. Tian, Revelle, Poshyvanyk.(2009), “Using Latent Dirichlet Allocation for CR - Automatic Categorization of Software”. 6th Ieee International Working Conference on Mining Software Repositories pp.163-166. Wei, Croft,(2006) “LDA-based document models for ad-hoc retrieval”, in: Proc. th Annu. Int. ACM SIGIR Conf. on Research & Development on Information CR - Retrieval, WA, USA , pp. 178–185. Zheng, McLean, Lu, (2006), “Identifying biological concepts from a protein- related corpus with a probabilistic topic model”. Bmc Bioinformatics Vol.7. CR - Park, Ramamohanarao,.(2009), “The Sensitivity of Latent Dirichlet Allocation for CR - Information Retrieval”. (In: Buntine, Grobelnik, Mladenić, Shawe-Taylor-Ed. ,Machine Learning and Knowledge Discovery in Databases): Springer Berlin Heidelberg, pp. 176-188. UR - https://dergipark.org.tr/en/pub/ijebeg/issue//275912 L1 - https://dergipark.org.tr/en/download/article-file/257109 ER -