Sciweavers

ALENEX
2009

Theory and Practise of Monotone Minimal Perfect Hashing

13 years 5 months ago
Theory and Practise of Monotone Minimal Perfect Hashing
Minimal perfect hash functions have been shown to be useful to compress data in several data management tasks. In particular, order-preserving minimal perfect hash functions [10] have been used to retrieve the position of a key in a given list of keys: however, the ability to preserve any given order leads to an unavoidable (n log n) lower bound on the number of bits required to store the function. Recently, it was observed [1] that very frequently the keys to be hashed are sorted in their intrinsic (i.e., lexicographical) order. This is typically the case of dictionaries of search engines, list of URLs of web graphs, etc. We refer to this restricted version of the problem as monotone minimal perfect hashing. We analyse experimentally the data structures proposed in [1], and along our way we propose some new methods that, albeit asymptotically equivalent or worse, perform very well in practise, and provide a balance between access speed, ease of construction, and space usage.
Djamal Belazzougui, Paolo Boldi, Rasmus Pagh, Seba
Added 08 Nov 2010
Updated 08 Nov 2010
Type Conference
Year 2009
Where ALENEX
Authors Djamal Belazzougui, Paolo Boldi, Rasmus Pagh, Sebastiano Vigna
Comments (0)