Sciweavers

1061 search results - page 140 / 213
» Massive Data Pre-Processing with a Cluster Based Approach
Sort
View
ACL
2011
14 years 7 months ago
Can Document Selection Help Semi-supervised Learning? A Case Study On Event Extraction
Annotating training data for event extraction is tedious and labor-intensive. Most current event extraction tasks rely on hundreds of annotated documents, but this is often not en...
Shasha Liao, Ralph Grishman
ADC
2007
Springer
145views Database» more  ADC 2007»
15 years 10 months ago
The Privacy of k-NN Retrieval for Horizontal Partitioned Data -- New Methods and Applications
Recently, privacy issues have become important in clustering analysis, especially when data is horizontally partitioned over several parties. Associative queries are the core retr...
Artak Amirbekyan, Vladimir Estivill-Castro
ICDE
2010
IEEE
273views Database» more  ICDE 2010»
16 years 3 months ago
WikiAnalytics: Ad-hoc Querying of Highly Heterogeneous Structured Data
Searching and extracting meaningful information out of highly heterogeneous datasets is a hot topic that received a lot of attention. However, the existing solutions are based on e...
Andrey Balmin, Emiran Curtmola
IVC
2007
94views more  IVC 2007»
15 years 4 months ago
Vector quantization and fuzzy ranks for image reconstruction
The problem of clustering is often addressed with techniques based on a Voronoi partition of the data space. Vector quantization is based on a similar principle, but it is a diffe...
Stefano Rovetta, Francesco Masulli
CCS
2009
ACM
16 years 4 months ago
The union-split algorithm and cluster-based anonymization of social networks
Knowledge discovery on social network data can uncover latent social trends and produce valuable findings that benefit the welfare of the general public. A growing amount of resea...
Brian Thompson, Danfeng Yao