High-occupancy itemset mining aims to identify itemsets within databases whose occupancy values satisfy a specified minimum threshold set by the user. However, selecting a suitable threshold can be difficult for users. If the threshold is set too low, it can result in too many itemsets, causing inefficiencies in terms of time and memory usage during the mining process and making it harder for decision-makers to interpret the results. On the other hand, setting the threshold too high may lead to the omission of valuable itemsets. To overcome this limitation, this paper extends the classical high-occupancy itemset mining problem into the top-k high-occupancy itemset mining problem and proposes an algorithm called TKHOIM (top-k high-occupancy itemset miner) that applies three strategies to address the problem efficiently. In this approach, users can directly specify the number of itemsets to be discovered, denoted as k, without the need to define a minimum occupancy threshold. Experimental results demonstrate that TKHOIM is effective in discovering the top-k high-occupancy itemsets.
Ethics committee approval was not required for this study because of there was no study on animals or humans.
High-occupancy itemset mining aims to identify itemsets within databases whose occupancy values satisfy a specified minimum threshold set by the user. However, selecting a suitable threshold can be difficult for users. If the threshold is set too low, it can result in too many itemsets, causing inefficiencies in terms of time and memory usage during the mining process and making it harder for decision-makers to interpret the results. On the other hand, setting the threshold too high may lead to the omission of valuable itemsets. To overcome this limitation, this paper extends the classical high-occupancy itemset mining problem into the top-k high-occupancy itemset mining problem and proposes an algorithm called TKHOIM (top-k high-occupancy itemset miner) that applies three strategies to address the problem efficiently. In this approach, users can directly specify the number of itemsets to be discovered, denoted as k, without the need to define a minimum occupancy threshold. Experimental results demonstrate that TKHOIM is effective in discovering the top-k high-occupancy itemsets.
Ethics committee approval was not required for this study because of there was no study on animals or humans.
| Birincil Dil | İngilizce |
|---|---|
| Konular | Bilgi Sistemleri Geliştirme Metodolojileri ve Uygulamaları, Karar Desteği ve Grup Destek Sistemleri |
| Bölüm | Research Articles |
| Yazarlar | |
| Erken Görünüm Tarihi | 12 Kasım 2025 |
| Yayımlanma Tarihi | 15 Kasım 2025 |
| Gönderilme Tarihi | 16 Temmuz 2025 |
| Kabul Tarihi | 17 Eylül 2025 |
| Yayımlandığı Sayı | Yıl 2025 Cilt: 8 Sayı: 6 |