In this study, Internet users were
clustered by the search keywords which they type into search bars of search
engines. Our proposed software is called UQCS (User Queries Clustering System) and
it was developed to demonstrate the efficiency of our hypothesis. UQCS
co-operates with the Strehl’s relationship based clustering toolkit and
performs segmentation on users based on the keywords they use for searching the
web. Internet Proxy server logs were parsed and query strings were extracted
from the search engine URL’s and the resulting IP-Term matrix was converted
into a similarity matrix using Euclidean, Jaccard, Cosine Distance and Pearson
Correlation Distance metrics. K- Means and graph-based OPOSSUM algorithm were
used to perform clustering on the similarity matrices. Results were illustrated by using CLUSION
visualization toolkit.
Konular | Mühendislik |
---|---|
Bölüm | Makaleler |
Yazarlar | |
Yayımlanma Tarihi | 29 Aralık 2017 |
Yayımlandığı Sayı | Yıl 2017 Cilt: 13 Sayı: 4 |