Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams


Younghee Kim, Wonyoung Kim, Ungmo Kim, Journal of Information Processing Systems Vol. 6, No. 1, pp. 79-90, Mar. 2010  

10.3745/JIPS.2010.6.1.079
Keywords: Frequent Itemsets, Weighted Support, Window Sliding, Weighted Support FP-Tree, Data Stream, WSFI-Mine
Fulltext:

Abstract

A data stream is a massive unbounded sequence of data elements continuously generated at a rapid rate. The continuous characteristic of streaming data necessitates the use of algorithms that require only one scan over the stream for knowledge discovery. Data mining over data streams should support the flexible trade-off between processing time and mining accuracy. In many application areas, mining frequent itemsets has been suggested to find important frequent itemsets by considering the weight of itemsets. In this paper, we present an efficient algorithm WSFI (Weighted Support Frequent Itemsets)- Mine with normalized weight over data streams. Moreover, we propose a novel tree structure, called the Weighted Support FP-Tree (WSFP-Tree), that stores compressed crucial information about frequent itemsets. Empirical results show that our algorithm outperforms comparative algorithms under the windowed streaming model.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from November 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.




Cite this article
[APA Style]
Kim, Y., Kim, W., & Kim, U. (2010). Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams. Journal of Information Processing Systems, 6(1), 79-90. DOI: 10.3745/JIPS.2010.6.1.079.

[IEEE Style]
Y. Kim, W. Kim, U. Kim, "Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams," Journal of Information Processing Systems, vol. 6, no. 1, pp. 79-90, 2010. DOI: 10.3745/JIPS.2010.6.1.079.

[ACM Style]
Younghee Kim, Wonyoung Kim, and Ungmo Kim. 2010. Mining Frequent Itemsets with Normalized Weight in Continuous Data Streams. Journal of Information Processing Systems, 6, 1, (2010), 79-90. DOI: 10.3745/JIPS.2010.6.1.079.