|
1. |
Adaptive Frequency Counting over Bursty Data Streams
Lin, B.; Wai-Shing Ho; Kao, B.; Chun-Kit Chui;
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
March 1 2007-April 5 2007
Page(s):516
-
523
Abstract:
We investigate the problem of frequent itemset mining over a data stream with bursty traffic. In many modern applications, data arrives at a system as a continuous stream of transactions. In many cases, the arrival rate of transactions fluctuates wildly. Traditional stream mining algorithms, such as Lossy Counting (LC), were generally designed to handle data streams with steady data arrival rates. We show that LC suffers significant loss of accuracy when the data stream is bursty. We propose the Adaptive Frequency Counting algorithm (AFC) to handle bursty data. AFC has a feedback mechanism that dynamically adjusts the mining speed to cope with the changing data arrival rate. Through extensive experiments, we show that AFC outperforms LC under bursty traffics in terms of the accuracy of the set of frequent itemsets
|