Sciweavers

NLPRS
2001
Springer
13 years 9 months ago
A Separate-and-Learn Approach to EM Learning of PCFGs
WeproposeanewapproachtoEMlearning of PCFGs. We completely separate the process of EM learning from that of parsing, andfor theformer, weintroduce a new EM algorithm called the gra...
Taisuke Sato, Shigeru Abe, Yoshitaka Kameya, Kiyoa...
NLPRS
2001
Springer
13 years 9 months ago
Topic Segmentation : A First Stage to Dialog-Based Information Extraction
We study the problem of topic segmentation of manually transcribed speech in order to facilitate information extraction from dialogs. Our approach is based on a combination of mul...
Narjès Boufaden, Guy Lapalme, Yoshua Bengio
NLPRS
2001
Springer
13 years 9 months ago
Long Sentence Partitioning using Structure Analysis for Machine Translation
in machine translation, long sentences are usually assumed to be difficult to treat. The main reason is the syntactic ambiguity which increases explosively as a sentence become lo...
Yoon-Hyung Roh, Young Ae Seo, Ki-Young Lee, Sung-K...
NLPRS
2001
Springer
13 years 9 months ago
A Hierarchical EM Approach to Word Segmentation
We propose a simple two-level hierarchical probability model for unsupervised word segmentation. By treating words as strings composed of morphemes/phonemes which are themselves c...
Fuchun Peng, Dale Schuurmans
NLPRS
2001
Springer
13 years 9 months ago
Korean Text Generation from Database for Homeshopping Sites
This paper describes a text generation system, XExplainer, which can dynamically produce a description of commodities in Korean from a relational database for homeshopping sites. ...
Ji-Eun Roh, Sin-Jae Kang, Jong-Hyeok Lee
NLPRS
2001
Springer
13 years 9 months ago
Linguistic Techniques to Improve the Performance of Automatic Text Categorization
This paper presents a method for incorporating natural language processing into existing text categorization procedures. Three aspects are considered in the investigation: (i) a m...
Akiko N. Aizawa
NLPRS
2001
Springer
13 years 9 months ago
A Simple Closed-Class/Open-Class Factorization for Improved Language Modeling
We describe a simple improvement to ngram language models where we estimate the distribution over closed-class (function) words separately from the conditional distribution of ope...
Fuchun Peng, Dale Schuurmans