Sciweavers

FAST
2010

Efficient Object Storage Journaling in a Distributed Parallel File System

13 years 6 months ago
Efficient Object Storage Journaling in a Distributed Parallel File System
Journaling is a widely used technique to increase file system robustness against metadata and/or data corruptions. While the overhead of journaling can be masked by the page cache for small-scale, local file systems, we found that Lustre's use of journaling for the object store significantly impacted the overall performance of our large-scale centerwide parallel file system. By requiring that each write request wait for a journal transaction to commit, Lustre introduced serialization to the client request stream and imposed additional latency due to disk head movement (seeks) for each request. In this paper, we present the challenges we faced while deploying a very large scale production storage system. Our work provides a head-to-head comparison of two significantly different approaches to increasing the overall efficiency of the Lustre file system. First, we present a hardware solution using external journaling devices to eliminate the latencies incurred by the extra disk head ...
Sarp Oral, Feiyi Wang, David Dillow, Galen M. Ship
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2010
Where FAST
Authors Sarp Oral, Feiyi Wang, David Dillow, Galen M. Shipman, Ross Miller, Oleg Drokin
Comments (0)