Sciweavers

ACL
2003
13 years 5 months ago
tRuEcasIng
Truecasing is the process of restoring case information to badly-cased or noncased text. This paper explores truecasing issues and proposes a statistical, language modeling based ...
Lucian Vlad Lita, Abraham Ittycheriah, Salim Rouko...
ACL
2003
13 years 5 months ago
Learning to Predict Pitch Accents and Prosodic Boundaries in Dutch
We train a decision tree inducer (CART) and a memory-based classifier (MBL) on predicting prosodic pitch accents and breaks in Dutch text, on the basis of shallow, easy-to-comput...
Erwin Marsi, Martin Reynaert, Antal van den Bosch,...
ACL
2003
13 years 5 months ago
Clustering Polysemic Subcategorization Frame Distributions Semantically
Previous research has demonstrated the utility of clustering in inducing semantic verb classes from undisambiguated corpus data. We describe a new approach which involves clusteri...
Anna Korhonen, Yuval Krymolowski, Zvika Marx
ACL
2003
13 years 5 months ago
Flexible Guidance Generation Using User Model in Spoken Dialogue Systems
We address appropriate user modeling in order to generate cooperative responses to each user in spoken dialogue systems. Unlike previous studies that focus on user’s knowledge o...
Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara...
ACL
2003
13 years 5 months ago
Feature-Rich Statistical Translation of Noun Phrases
We define noun phrase translation as a subtask of machine translation. This enables us to build a dedicated noun phrase translation subsystem that improves over the currently bes...
Philipp Koehn, Kevin Knight
ACL
2003
13 years 5 months ago
An Expert Lexicon Approach to Identifying English Phrasal Verbs
Phrasal Verbs are an important feature of the English language. Properly identifying them provides the basis for an English parser to decode the related structures. Phrasal verbs ...
Wei Li 0003, Xiuhong Zhang, Cheng Niu, Yuankai Jia...
ACL
2003
13 years 5 months ago
A Syllable Based Word Recognition Model for Korean Noun Extraction
Noun extraction is very important for many NLP applications such as information retrieval, automatic text classification, and information extraction. Most of the previous Korean ...
Do-Gil Lee, Hae-Chang Rim, Heui-Seok Lim
ACL
2003
13 years 5 months ago
Language Model Based Arabic Word Segmentation
Young-Suk Lee, Kishore Papineni, Salim Roukos, Oss...
ACL
2003
13 years 5 months ago
Accurate Unlexicalized Parsing
We demonstrate that an unlexicalized PCFG can parse much more accurately than previously shown, by making use of simple, linguistically motivated state splits, which break down fa...
Dan Klein, Christopher D. Manning