Efficient Enumeration of Frequent Sequences

12 years 1 months ago
Efficient Enumeration of Frequent Sequences
In this paper we present SPADE, a new algorithm for fast discovery of Sequential Patterns. The existing solutions to this problem make repeated database scans, and use complex hash structures which have poor locality. SPADE utilizes combinatorial properties to decompose the original problem into smaller sub-problems, that can be independently solved in main-memory using efficient lattice search techniques, and using simple join operations. All sequences are discovered in only three database scans. Experiments show that SPADE outperforms the best previous algorithm by a factor of two, and by an order of magnitude with some pre-processed data. It also has linear scalability with respect to the number of customers, and a number of other database parameters.
Mohammed Javeed Zaki
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1998
Where CIKM
Authors Mohammed Javeed Zaki
Comments (0)