Sciweavers

JEA
2008

Better external memory suffix array construction

13 years 4 months ago
Better external memory suffix array construction
Suffix arrays are a simple and powerful data structure for text processing that can be used for full text indexes, data compression, and many other applications in particular in bioinformatics. However, so far it has looked prohibitive to build suffix arrays for huge inputs that do not fit into main memory. This paper presents design, analysis, implementation, and experimental evaluation of several new and improved algorithms for suffix array construction. The algorithms are asymptotically optimal in the worst case or on the average. Our implementation can construct suffix arrays for inputs of up to 4GBytes in hours on a low cost machine. As a tool of possible independent interest we present a systematic way to design, analyze, and implement pipelined algorithms.
Roman Dementiev, Juha Kärkkäinen, Jens M
Added 13 Dec 2010
Updated 13 Dec 2010
Type Journal
Year 2008
Where JEA
Authors Roman Dementiev, Juha Kärkkäinen, Jens Mehnert, Peter Sanders
Comments (0)