Sciweavers

CCGRID
2009
IEEE

File Clustering Based Replication Algorithm in a Grid Environment

13 years 9 months ago
File Clustering Based Replication Algorithm in a Grid Environment
Replication in grid file systems can significantly improves I/O performance of data-intensive applications. However, most of existing replication techniques apply to individual files, which may introduce inefficient replication overheads for a large number of files. We propose a file clustering based replication algorithm for grid file systems. Our algorithm groups files according to a relationship of simultaneous accesses between files and stores the replicas of the clustered files into storage nodes, to satisfy expected most of future read access times to the clustered files and replication times for individual files being minimized under the given storage capacity limitation. Our experiments on a given grid environment, 20 nodes of 5 sites, suggest that the proposed algorithm achieves accurate file clustering and efficient replica management; our clustering policy with the file cluster size limit of 5120 MB and storage capacity limit
Hitoshi Sato, Satoshi Matsuoka, Toshio Endo
Added 10 Jul 2010
Updated 10 Jul 2010
Type Conference
Year 2009
Where CCGRID
Authors Hitoshi Sato, Satoshi Matsuoka, Toshio Endo
Comments (0)