2002 IEEE International Conference on Data Mining, 2002. Proceedings.
Download PDF

Abstract

Maximal frequent itemsets (MFI) are crucial to many tasks in data mining. Since the MaxMiner algorithm first introduced enumeration trees for mining MFI in 1998, several methods have been proposed to use depth first search to improve performance. To further improve the performance of mining MFI, we proposed a technique that takes advantage of the information gathered from previous steps to discover new MFI. More specifically, our algorithm called SmartMiner gathers and passes tail information and uses a heuristic select function which uses the tail information to select the next node to explore. Compared with Mafia and GenMax, SmartMiner generates a smaller search tree, requires a smaller number of support counting, and does not require superset checking. Using the datasets Mushroom and Connect, our experimental study reveals that SmartMiner generates the same MFI as Mafia and GenMax, but yields an order of magnitude improvement in speed.
Like what you’re reading?
Already a member?
Get this article FREE with a new membership!

Related Articles