Sciweavers

637 search results - page 98 / 128
» Generating Synthetic Data to Match Data Mining Patterns
Sort
View
CIKM
2009
Springer
15 years 8 months ago
Space-economical partial gram indices for exact substring matching
Exact substring matching queries on large data collections can be answered using q-gram indices, that store for each occurring q-byte pattern an (ordered) posting list with the po...
Nan Tang, Lefteris Sidirourgos, Peter A. Boncz
217
Voted
SIGMOD
2007
ACM
195views Database» more  SIGMOD 2007»
16 years 4 months ago
Effective variation management for pseudo periodical streams
Many database applications require the analysis and processing of data streams. In such systems, huge amounts of data arrive rapidly and their values change over time. The variati...
Lv-an Tang, Bin Cui, Hongyan Li, Gaoshan Miao, Don...
PVLDB
2010
110views more  PVLDB 2010»
15 years 2 months ago
Behavior Based Record Linkage
In this paper, we present a new record linkage approach that uses entity behavior to decide if potentially different entities are in fact the same. An entity’s behavior is extra...
Mohamed Yakout, Ahmed K. Elmagarmid, Hazem Elmelee...
GIS
2008
ACM
16 years 4 months ago
Density based co-location pattern discovery
Co-location pattern discovery is to find classes of spatial objects that are frequently located together. For example, if two categories of businesses often locate together, they ...
Xiangye Xiao, Xing Xie, Qiong Luo, Wei-Ying Ma
DASFAA
2006
IEEE
168views Database» more  DASFAA 2006»
15 years 7 months ago
COWES: Clustering Web Users Based on Historical Web Sessions
Clustering web users is one of the most important research topics in web usage mining. Existing approaches cluster web users based on the snapshots of web user sessions. They do no...
Ling Chen 0002, Sourav S. Bhowmick, Jinyan Li