Sciweavers

PVLDB
2008

Sorting hierarchical data in external memory for archiving

13 years 3 months ago
Sorting hierarchical data in external memory for archiving
Sorting hierarchical data in external memory is necessary for a wide variety of applications including archiving scientific data and dealing with large XML datasets. The topic of sorting hierarchical data, however, has received little attention from the research community so far. In this paper we focus on sorting arbitrary hierarchical data that far exceed the size of physical memory. We propose HERMES, an algorithm that generalizes the most widely-used techniques for sorting flat data in external memory. HERMES efficiently exploits the hierarchical structure to minimize the number of disk accesses and optimize the use of available memory. We extract the theoretical bounds of the algorithm with respect to the structure of the hierarchical dataset. We then show how the algorithm can be used to support efficient archiving. We have conducted an experimental study using several workloads and comparing HERMES to the state-of-the-art approaches. Our results show that our algorithm (a) meets...
Ioannis Koltsidas, Heiko Müller, Stratis Vigl
Added 28 Dec 2010
Updated 28 Dec 2010
Type Journal
Year 2008
Where PVLDB
Authors Ioannis Koltsidas, Heiko Müller, Stratis Viglas
Comments (0)