Sciweavers

1768 search results - page 226 / 354
» Mining Very Large Databases
Sort
View
136
Voted
VLDB
2004
ACM
120views Database» more  VLDB 2004»
15 years 9 months ago
Relational link-based ranking
Link analysis methods show that the interconnections between web pages have lots of valuable information. The link analysis methods are, however, inherently oriented towards analy...
Floris Geerts, Heikki Mannila, Evimaria Terzi
135
Voted
SDM
2009
SIAM
170views Data Mining» more  SDM 2009»
16 years 29 days ago
Optimal Distance Bounds on Time-Series Data.
Most data mining operations include an integral search component at their core. For example, the performance of similarity search or classification based on Nearest Neighbors is ...
Michail Vlachos, Philip S. Yu, Suleyman S. Kozat
164
Voted
SIGMOD
2000
ACM
165views Database» more  SIGMOD 2000»
15 years 8 months ago
Finding Generalized Projected Clusters In High Dimensional Spaces
High dimensional data has always been a challenge for clustering algorithms because of the inherent sparsity of the points. Recent research results indicate that in high dimension...
Charu C. Aggarwal, Philip S. Yu
153
Voted
KDD
2008
ACM
217views Data Mining» more  KDD 2008»
16 years 4 months ago
Stream prediction using a generative model based on frequent episodes in event sequences
This paper presents a new algorithm for sequence prediction over long categorical event streams. The input to the algorithm is a set of target event types whose occurrences we wis...
Srivatsan Laxman, Vikram Tankasali, Ryen W. White
131
Voted
KAIS
2008
114views more  KAIS 2008»
15 years 3 months ago
A new concise representation of frequent itemsets using generators and a positive border
A complete set of frequent itemsets can get undesirably large due to redundancy when the minimum support threshold is low or when the database is dense. Several concise representat...
Guimei Liu, Jinyan Li, Limsoon Wong