Sciweavers

COLING
2010

Incremental Chinese Lexicon Extraction with Minimal Resources on a Domain-Specific Corpus

12 years 12 months ago
Incremental Chinese Lexicon Extraction with Minimal Resources on a Domain-Specific Corpus
This article presents an original lexical unit extraction system for Chinese. The method is based on an incremental process driven by an association score featuring a minimal resources statistically aided linguistic approach. We also introduce a linguistics-based lexical unit definition and use it to describe an evaluation protocol dedicated to the task. The experimental results on a domain specific corpus show that the method performs better than other approaches. The extraction results, evaluated on a random sample of the working corpus, show a recall of 68.4 % and precision of 37.1 %.
Gaël Patin
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Gaël Patin
Comments (0)