Sciweavers

627 search results - page 67 / 126
» Privacy-Preserving k-NN for Small and Large Data Sets
Sort
View
APPROX
2008
Springer
101views Algorithms» more  APPROX 2008»
15 years 1 months ago
Streaming Algorithms for k-Center Clustering with Outliers and with Anonymity
Clustering is a common problem in the analysis of large data sets. Streaming algorithms, which make a single pass over the data set using small working memory and produce a cluster...
Richard Matthew McCutchen, Samir Khuller
NAACL
2010
14 years 9 months ago
Minimally-Supervised Extraction of Entities from Text Advertisements
Extraction of entities from ad creatives is an important problem that can benefit many computational advertising tasks. Supervised and semi-supervised solutions rely on labeled da...
Sameer Singh, Dustin Hillard, Chris Leggetter
BIBE
2007
IEEE
136views Bioinformatics» more  BIBE 2007»
15 years 1 months ago
A Two-Stage Gene Selection Algorithm by Combining ReliefF and mRMR
Abstract—Gene expression data usually contains a large number of genes, but a small number of samples. Feature selection for gene expression data aims at finding a set of genes ...
Yi Zhang, Chris H. Q. Ding, Tao Li
ICDM
2006
IEEE
76views Data Mining» more  ICDM 2006»
15 years 5 months ago
A Probabilistic Ensemble Pruning Algorithm
An ensemble is a group of learners that work together as a committee to solve a problem. However, the existing ensemble training algorithms sometimes generate unnecessary large en...
Huanhuan Chen, Peter Tiño, Xin Yao
PVLDB
2008
205views more  PVLDB 2008»
14 years 11 months ago
Making SENSE: socially enhanced search and exploration
Online communities like Flickr, del.icio.us and YouTube have established themselves as very popular and powerful services for publishing and searching contents, but also for ident...
Tom Crecelius, Mouna Kacimi, Sebastian Michel, Tho...