Research Article

Mining Top-K High Occupancy Itemsets

Volume: 8 Number: 6 November 15, 2025
TR EN

Mining Top-K High Occupancy Itemsets

Abstract

High-occupancy itemset mining aims to identify itemsets within databases whose occupancy values satisfy a specified minimum threshold set by the user. However, selecting a suitable threshold can be difficult for users. If the threshold is set too low, it can result in too many itemsets, causing inefficiencies in terms of time and memory usage during the mining process and making it harder for decision-makers to interpret the results. On the other hand, setting the threshold too high may lead to the omission of valuable itemsets. To overcome this limitation, this paper extends the classical high-occupancy itemset mining problem into the top-k high-occupancy itemset mining problem and proposes an algorithm called TKHOIM (top-k high-occupancy itemset miner) that applies three strategies to address the problem efficiently. In this approach, users can directly specify the number of itemsets to be discovered, denoted as k, without the need to define a minimum occupancy threshold. Experimental results demonstrate that TKHOIM is effective in discovering the top-k high-occupancy itemsets.

Keywords

Ethical Statement

Ethics committee approval was not required for this study because of there was no study on animals or humans.

References

  1. Chen J, Yang S, Ding W, Li P, Liu A, Zhang H, Li T. 2024. Incremental high average-utility itemset mining: survey and challenges. Sci Rep, 14: 9924.
  2. Deng Z. 2013. Mining top‐rank‐k erasable itemsets by PID_lists. Int J Intell Syst, 28: 366-379.
  3. Deng ZH. 2020. Mining high occupancy itemsets. Future Gener Comput Syst, 102: 222-229.
  4. Hong TP, Huang WM, Lan GC, Chiang MC, Lin JCW. 2021. A bitmap approach for mining erasable itemsets. IEEE Access, 9: 106029-106038.
  5. Huynh B, Tung NT, Nguyen TD, Bui QT, Nguyen LT, Yun U, Vo B. 2024. An efficient strategy for mining high-efficiency itemsets in quantitative databases. Knowl Based Syst, 299: 112035.
  6. Kim H, Cho M, Nam H, Baek Y, Park S, Kim D, Vo B, Yun U. 2024. Advanced incremental erasable pattern mining from the time-sensitive data stream. Knowl Based Syst, 299: 112001.
  7. Kim H, Cho M, Park S, Kim D, Kim D, Yun U. 2025. Damped weighted erasable itemset mining with time sensitive dynamic environments. J Big Data, 12: 20.
  8. Kim H, Ryu T, Lee C, Kim H, Truong T, Fournier-Viger P, Pedrycz W, Yun U. 2022. Mining high occupancy patterns to analyze incremental data in intelligent systems. ISA Trans, 131: 460–475.

Details

Primary Language

English

Subjects

Information Systems Development Methodologies and Practice, Decision Support and Group Support Systems

Journal Section

Research Article

Early Pub Date

November 12, 2025

Publication Date

November 15, 2025

Submission Date

July 16, 2025

Acceptance Date

September 17, 2025

Published in Issue

Year 2025 Volume: 8 Number: 6

APA
Yıldırım, İ. (2025). Mining Top-K High Occupancy Itemsets. Black Sea Journal of Engineering and Science, 8(6), 1723-1730. https://doi.org/10.34248/bsengineering.1744061
AMA
1.Yıldırım İ. Mining Top-K High Occupancy Itemsets. BSJ Eng. Sci. 2025;8(6):1723-1730. doi:10.34248/bsengineering.1744061
Chicago
Yıldırım, İrfan. 2025. “Mining Top-K High Occupancy Itemsets”. Black Sea Journal of Engineering and Science 8 (6): 1723-30. https://doi.org/10.34248/bsengineering.1744061.
EndNote
Yıldırım İ (November 1, 2025) Mining Top-K High Occupancy Itemsets. Black Sea Journal of Engineering and Science 8 6 1723–1730.
IEEE
[1]İ. Yıldırım, “Mining Top-K High Occupancy Itemsets”, BSJ Eng. Sci., vol. 8, no. 6, pp. 1723–1730, Nov. 2025, doi: 10.34248/bsengineering.1744061.
ISNAD
Yıldırım, İrfan. “Mining Top-K High Occupancy Itemsets”. Black Sea Journal of Engineering and Science 8/6 (November 1, 2025): 1723-1730. https://doi.org/10.34248/bsengineering.1744061.
JAMA
1.Yıldırım İ. Mining Top-K High Occupancy Itemsets. BSJ Eng. Sci. 2025;8:1723–1730.
MLA
Yıldırım, İrfan. “Mining Top-K High Occupancy Itemsets”. Black Sea Journal of Engineering and Science, vol. 8, no. 6, Nov. 2025, pp. 1723-30, doi:10.34248/bsengineering.1744061.
Vancouver
1.İrfan Yıldırım. Mining Top-K High Occupancy Itemsets. BSJ Eng. Sci. 2025 Nov. 1;8(6):1723-30. doi:10.34248/bsengineering.1744061

                            24890