Sciweavers

COLING
2010
13 years 5 days ago
Visually and Phonologically Similar Characters in Incorrect Simplified Chinese Words
Visually and phonologically similar characters are major contributing factors for errors in Chinese text. By defining appropriate similarity measures that consider extended Cangji...
Chao-Lin Liu, Min-Hua Lai, Yi-Hsuan Chuang, Chia-Y...
COLING
2010
13 years 5 days ago
Semi-supervised Semantic Pattern Discovery with Guidance from Unsupervised Pattern Clusters
We present a simple algorithm for clustering semantic patterns based on distributional similarity and use cluster memberships to guide semi-supervised pattern discovery. We apply ...
Ang Sun, Ralph Grishman
COLING
2010
13 years 5 days ago
Topic-Based Bengali Opinion Summarization
In this paper the development of an opinion summarization system that works on Bengali News corpus has been described. The system identifies the sentiment information in each docu...
Amitava Das, Sivaji Bandyopadhyay
COLING
2010
13 years 5 days ago
Evaluating FrameNet-style semantic parsing: the role of coverage gaps in FrameNet
Supervised semantic role labeling (SRL) systems are generally claimed to have accuracies in the range of 80% and higher (Erk and Pad
Alexis Palmer, Caroline Sporleder
COLING
2010
13 years 5 days ago
Generative Alignment and Semantic Parsing for Learning from Ambiguous Supervision
We present a probabilistic generative model for learning semantic parsers from ambiguous supervision. Our approach learns from natural language sentences paired with world states ...
Joohyun Kim, Raymond J. Mooney
COLING
2010
13 years 5 days ago
Unsupervised Discriminative Language Model Training for Machine Translation using Simulated Confusion Sets
An unsupervised discriminative training procedure is proposed for estimating a language model (LM) for machine translation (MT). An English-to-English synchronous context-free gra...
Zhifei Li, Ziyuan Wang, Sanjeev Khudanpur, Jason E...
COLING
2010
13 years 5 days ago
A framework for representing lexical resources
Our goal is to propose a description model for the lexicon. We describe a software framework for representing the lexicon and its variations called Proteus. Various examples show ...
Fabrice Issac
COLING
2010
13 years 5 days ago
Extraction of Multi-word Expressions from Small Parallel Corpora
We present a general methodology for extracting multi-word expressions (of various types), along with their translations, from small parallel corpora. We automatically align the p...
Yulia Tsvetkov, Shuly Wintner
COLING
2010
13 years 5 days ago
Towards the Adequate Evaluation of Morphosyntactic Taggers
There exists a well-established and almost unanimously adopted measure of tagger performance, namely, accuracy. Although it is perfectly adequate for small tagsets and typical app...
Szymon Acedanski, Adam Przepiórkowski