Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

24

COLING
2002

favoriteEmaildiscussreport

97views Computational Linguistics» more COLING 2002»

Detecting Errors in Corpora Using Support Vector Machines

13 years 9 months ago

Detecting Errors in Corpora Using Support Vector Machines

Download acl.ldc.upenn.edu

While the corpus-based research relies on human annotated corpora, it is often said that a non-negligible amount of errors remain even in frequently used corpora such as Penn Treebank. Detection of errors in annotated corpora is important for corpus-based natural language processing. In this paper, we propose a method to detect errors in corpora using support vector machines (SVMs). This method is based on the idea of extracting exceptional elements that violate consistency. We propose a method of using SVMs to assign a weight to each element and to find errors in a POS tagged corpus. We apply the method to English and Japanese POS-tagged corpora and achieve high precision in detecting errors.

Tetsuji Nakagawa, Yuji Matsumoto

Real-time Traffic

COLING 2002 | COLING 2008 | Corpus-based Natural Language | Corpus-based Research | Human Annotated Corpora |

claim paper

Related Content

» Extracting Word Sequence Correspondences with Support Vector Machines

» Relation Extraction Using Support Vector Machine

» A Support Vector Machine Approach for Detection of Microcalcifications

» Phonetic Speaker Recognition with Support Vector Machines

» Object Detection in Images RunTime Complexity and Parameter Selection of Support Vector Ma...

» Support vector machine classifiers for sequential decision problems

» Incorporating Conditional Independence Assumption with Support Vector Machines to Enhance ...

» Estimating the Confidence Interval for Prediction Errors of Support Vector Machine Classif...

» Nearly Uniform Validation Improves CompressionBased Error Bounds

Post Info
More Details (n/a)

Added	17 Dec 2010
Updated	17 Dec 2010
Type	Journal
Year	2002
Where	COLING
Authors	Tetsuji Nakagawa, Yuji Matsumoto

Comments (0)