Sciweavers

CLUSTER
2004
IEEE

Hierarchical Bloom filter arrays (HBA): a novel, scalable metadata management system for large cluster-based storage

13 years 8 months ago
Hierarchical Bloom filter arrays (HBA): a novel, scalable metadata management system for large cluster-based storage
An efficient and distributed scheme for file mapping or file lookup scheme is critical in decentralizing metadata management within a group of metadata servers. This paper presents a novel technique called HBA (Hierarchical Bloom Filter Arrays) to map file names to the servers holding their metadata. Two levels of probabilistic arrays, i.e., Bloom Filter Arrays, with different accuracies are used on each metadata server. One array, with lower accuracy and representing the distribution of the entire metadata, trades accuracy for significantly reduced memory overhead, while the other array, with higher accuracy, caches partial distribution information and exploits the temporal locality of file access patterns. Extensive trace-driven simulations have shown our HBA design to be highly effective and efficient in improving performance and scalability of file systems in clusters with 1,000 to 10,000 nodes (or super-clusters).
Yifeng Zhu, Hong Jiang, Jun Wang
Added 20 Aug 2010
Updated 20 Aug 2010
Type Conference
Year 2004
Where CLUSTER
Authors Yifeng Zhu, Hong Jiang, Jun Wang
Comments (0)