Sciweavers

AIRS
2006
Springer

Efficient and Robust Phrase Chunking Using Support Vector Machines

13 years 8 months ago
Efficient and Robust Phrase Chunking Using Support Vector Machines
Automatic text chunking is a task which aims to recognize phrase structures in natural language text. It is the key technology of knowledge-based system where phrase structures provide important syntactic information for knowledge representation. Support Vector Machine (SVM-based) phrase chunking system had been shown to achieve high performance for text chunking. But its inefficiency limits the actual use on large dataset that only handles several thousands tokens per second. In this paper, we firstly show that the state-of-the-art performance (94.25) in the CoNLL-2000 shared task based on conventional SVM learning. However, the off-the-shelf SVM classifiers are inefficient when the number of phrase types scales to high. Therefore, we present two novel methods that make the system substantially faster in terms of training and testing while only results in a slightly decrease of system performance. Experimental result shows that our method achieves 94.09 in F rate, which handles 13000 ...
Yu-Chieh Wu, Jie-Chi Yang, Yue-Shi Lee, Show-Jane
Added 20 Aug 2010
Updated 20 Aug 2010
Type Conference
Year 2006
Where AIRS
Authors Yu-Chieh Wu, Jie-Chi Yang, Yue-Shi Lee, Show-Jane Yen
Comments (0)