Sciweavers

USENIX
2007

Implementation and Performance Evaluation of Fuzzy File Block Matching

13 years 6 months ago
Implementation and Performance Evaluation of Fuzzy File Block Matching
The fuzzy file block matching technique (fuzzy matching for short), was first proposed for opportunistic use of Content Addressable Storage. Fuzzy matching aims to increase the hit ratio in the content-addressable storage providers, and thus can improve the performance of underlying distributed file storage systems by potentially saving significant network bandwidth and reducing file transmission costs. Fuzzy matching employs shingling to represent the fuzzy hashing of file blocks for similarity detection, and error-correcting information to reconstruct the canonical content of a file block from some similar blocks. In this paper, we present the implementation details of fuzzy matching and a very basic evaluation of its performance. In particular, we show that fuzzy matching can recover new versions of GNU Emacs source from older versions.
Bo Han, Peter J. Keleher
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2007
Where USENIX
Authors Bo Han, Peter J. Keleher
Comments (0)