Mining Frequent Patterns on Dynamic Weighted Databases

Duy Ham Nguyen1, , Viet Anh Ngo2
1 Đại học Sài Gòn
2 Trường Đại học Sài Gòn, Việt Nam

Main Article Content

Abstract

Pattern mining is a fundamental task in modern data mining. Among its various branches, mining frequent weighted itemsets (FWIs) from quantitative databases stands as a significant problem, aiming to discover prevalent patterns within such datasets. However, current research has not adequately addressed quantitative databases where item weights dynamically change over time, known as dynamic weighted databases (dWDB). In real-world scenarios, the importance of items, such as product profitability or relevance at specific times, often fluctuates (e.g., air conditioners sell more in summer, and medical masks gain significance during respiratory disease outbreaks). This paper introduces a novel problem: mining FWIs with dynamic weighted items from quantitative databases, termed dynamic quantitative databases. We then propose dFWIT, an algorithm that leverages the traditional tidset structure to tackle this problem. Furthermore, we develop dFWNL, another algorithm that utilizes a new data structure called dWNList for mining FWIs from dWDB. Finally, we conduct experiments on various dWDBs to demonstrate the efficacy of the proposed algorithms

Article Details

References

Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, 207–216.
Agrawal, R., & Srikant, R. (1994). Fast algorithms for mining association rules. Proceedings of the 20th International Conference on Very Large Data Bases (VLDB), 487–499.
Bui, H., Vo, B., Nguyen, H., Nguyen-Hoang, T. A., & Hong, T. P. (2018). A weighted N-list-based method for mining frequent weighted itemsets. Expert Systems with Applications, 96, 388–405.
Bui, H., Vo, B., Nguyen-Hoang, T. A., & Yun, U. (2020). Mining frequent weighted closed itemsets using the WN-list structure and an early pruning strategy. Applied Intelligence, 51, 1439–1459.
Deng, Z., Wang, Z., & Jiang, J. (2012). A new algorithm for fast mining frequent itemsets using N-lists. Science China Information Sciences, 55(9), 2008–2030.
Han, J., Pei, J., & Yin, Y. (2000). Mining frequent patterns without candidate generation. Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 1–12.
Lee, G., Yun, U., & Ryu, K. (2017). Mining frequent weighted itemsets without storing transaction IDs and generating candidates. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 25(1), 111–144.
Nguyen, H., Le, T., Nguyen, M., Fournier-Viger, P., Tseng, V. S., & Vo, B. (2022). Mining frequent weighted utility itemsets in hierarchical quantitative databases. Knowledge-Based Systems, 237, 107709.
Nguyen, H., Vo, B., Nguyen, M., & Pedrycz, W. (2016). An efficient algorithm for mining frequent weighted itemsets using interval word segments. Applied Intelligence, 45(4), 1008–1020.
Nguyen, H., Vo, B., Nguyen, M. T. H., & Hong, T. (2015). An improved algorithm for mining frequent weighted itemsets. 2015 IEEE International Conference on Systems, Man, and Cybernetics, 2544–2549.
Tao, F., Murtagh, F., & Farid, M. (2003). Weighted association rule mining using weighted support and significance framework. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 661–666.
Vo, B., Bui, H., Vo, T., & Le, T. (2020). Mining top-rank-k frequent weighted itemsets using WN-list structures and an early pruning strategy. Knowledge-Based Systems, 201–202, 106064.
Vo, B., Coenen, F., & Le, B. (2013). A new method for mining frequent weighted itemsets based on WIT-Tree. Expert Systems with Applications, 40(4), 1256–1264.
Vo, B., Le, T., Coenen, F., & Hong, T. (2016). Mining frequent itemsets using the N-list and subsume concepts. International Journal of Machine Learning and Cybernetics, 7(2), 253–265.
Vo, B., Le, T., Hong, T., & Le, B. (2014). An effective approach for maintenance of pre-large-based frequent-itemset lattice in incremental mining. Applied Intelligence, 41(3), 759–775.
Zaki, M. J. (2000). Scalable algorithms for association mining. IEEE Transactions on Knowledge and Data Engineering, 12(3), 372–390.