Sciweavers

SCCC
2002
IEEE

Improved Antidictionary Based Compression

13 years 9 months ago
Improved Antidictionary Based Compression
The compression of binary texts using antidictionaries is a novel technique based on the fact that some substrings (called “antifactors”) never appear in the text. Let × be an antifactor, where is its last bit. Every time × appears in the text we know that the next bit is and hence omit its representation. Since building the set of all antifactors is space consuming at compression time, it is customary to limit the maximum length of antifactors considered up to a constant . Larger yields better compression of the text but requires more space at compression time. In this paper we introduce the notion of almost antifactors, which are strings that rarely appear in the text. More formally, almost antifactors are strings that, if we consider them as antifactors and separately code their occurrences as exceptions, the compression ratio improves. We show that almost antifactors permit improving compression with a limited amount of main memory to compress. Our experiments show that they...
Maxime Crochemore, Gonzalo Navarro
Added 15 Jul 2010
Updated 15 Jul 2010
Type Conference
Year 2002
Where SCCC
Authors Maxime Crochemore, Gonzalo Navarro
Comments (0)