Sciweavers

CLUSTER
2003
IEEE

A Case Study of Parallel I/O for Biological Sequence Search on Linux Clusters

13 years 9 months ago
A Case Study of Parallel I/O for Biological Sequence Search on Linux Clusters
In this paper we analyze the I/O access patterns of a widely-used biological sequence search tool and implement two variations that employ parallel-I/O for data access based on PVFS (Parallel Virtual File System) and CEFT-PVFS (Cost-Effective Fault-Tolerant PVFS). Experiments show that the two variations outperform the original tool when equal or even fewer storage devices are used in the former. It is also found that although the performance of the two variations improves consistently when initially increasing the number of servers, this performance gain from parallel I/O becomes insignificant with further increase in server number. We examine the effectiveness of two read performance optimization techniques in CEFT-PVFS by using this tool as a benchmark. Performance results indicate: (1) Doubling the degree of parallelism boosts the read performance to approach that of PVFS; (2) Skipping hotspots can substantially improve the I/O performance when the load on data servers is highly i...
Yifeng Zhu, Hong Jiang, Xiao Qin, David R. Swanson
Added 04 Jul 2010
Updated 04 Jul 2010
Type Conference
Year 2003
Where CLUSTER
Authors Yifeng Zhu, Hong Jiang, Xiao Qin, David R. Swanson
Comments (0)