Sciweavers

6 search results - page 1 / 2
» b-Bit minwise hashing
Sort
View
WWW
2010
ACM
13 years 11 months ago
b-Bit minwise hashing
This paper establishes the theoretical framework of b-bit minwise hashing. The original minwise hashing method has become a standard technique for estimating set similarity (e.g.,...
Ping Li, Arnd Christian König
ICDE
2010
IEEE
494views Database» more  ICDE 2010»
14 years 4 months ago
Hashing Tree-Structured Data: Methods and Applications
In this article we propose a new hashing framework for tree-structured data. Our method maps an unordered tree into a multiset of simple wedge-shaped structures refered to as pivot...
Shirish Tatikonda, Srinivasan Parthasarathy
ICALP
2010
Springer
13 years 9 months ago
On the k-Independence Required by Linear Probing and Minwise Independence
We show that linear probing requires 5-independent hash functions for expected constant-time performance, matching an upper bound of [Pagh et al. STOC’07]. For (1 + ε)-approxima...
Mihai Patrascu, Mikkel Thorup
STOC
2002
ACM
177views Algorithms» more  STOC 2002»
14 years 4 months ago
Similarity estimation techniques from rounding algorithms
A locality sensitive hashing scheme is a distribution on a family F of hash functions operating on a collection of objects, such that for two objects x, y, PrhF [h(x) = h(y)] = si...
Moses Charikar
SPIRE
2010
Springer
13 years 2 months ago
Fingerprinting Ratings for Collaborative Filtering - Theoretical and Empirical Analysis
Abstract. We consider fingerprinting methods for collaborative filtering (CF) systems. In general, CF systems show their real strength when supplied with enormous data sets. Earl...
Yoram Bachrach, Ralf Herbrich