Sciweavers

CPM
2008
Springer

A Black Box for Online Approximate Pattern Matching

13 years 6 months ago
A Black Box for Online Approximate Pattern Matching
Abstract. We present a deterministic black box solution for online approximate matching. Given a pattern of length m and a streaming text of length n that arrives one character at a time, the task is to report the distance between the pattern and a sliding window of the text as soon as the new character arrives. Our solution requires O( log2 m j=1 T(n, 2j-1 )/n) time for each input character, where T(n, m) is the total running time of the best offline algorithm. The types of approximation that are supported include exact matching with wildcards, matching under the Hamming norm, approximating the Hamming norm, k-mismatch and numerical measures such as the L2 and L1 norms. For these examples, the resulting online algorithms take O(log2 m), O( m log m), O(log2 m/ 2 ), O( k log k log m), O(log2 m) and O( m log m) time per character respectively. The space overhead is O(m) which we show is optimal.
Raphaël Clifford, Klim Efremenko, Benny Porat
Added 19 Oct 2010
Updated 19 Oct 2010
Type Conference
Year 2008
Where CPM
Authors Raphaël Clifford, Klim Efremenko, Benny Porat, Ely Porat
Comments (0)