Sciweavers

COLING
2010

Chart Pruning for Fast Lexicalised-Grammar Parsing

12 years 11 months ago
Chart Pruning for Fast Lexicalised-Grammar Parsing
Given the increasing need to process massive amounts of textual data, efficiency of NLP tools is becoming a pressing concern. Parsers based on lexicalised grammar formalisms, such as TAG and CCG, can be made more efficient using supertagging, which for CCG is so effective that every derivation consistent with the supertagger output can be stored in a packed chart. However, wide-coverage CCG parsers still produce a very large number of derivations for typical newspaper or Wikipedia sentences. In this paper we investigate two forms of chart pruning, and develop a novel method for pruning complete cells in a parse chart. The result is a widecoverage CCG parser that can process almost 100 sentences per second, with little or no loss in accuracy over the baseline with no pruning.
Yue Zhang 0004, Byung-Gyu Ahn, Stephen Clark, Curt
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Yue Zhang 0004, Byung-Gyu Ahn, Stephen Clark, Curt Van Wyk, James R. Curran, Laura Rimell
Comments (0)