Sciweavers

IJCNLP
2005
Springer
13 years 10 months ago
Aligning Needles in a Haystack: Paraphrase Acquisition Across the Web
This paper presents a lightweight method for unsupervised extraction of paraphrases from arbitrary textual Web documents. The method differs from previous approaches to paraphrase...
Marius Pasca, Péter Dienes
IJCNLP
2005
Springer
13 years 10 months ago
Web-Based Unsupervised Learning for Query Formulation in Question Answering
Yi-Chia Wang, Jian-Cheng Wu, Tyne Liang, Jason S. ...
IJCNLP
2005
Springer
13 years 10 months ago
Assigning Polarity Scores to Reviews Using Machine Learning Techniques
We propose a novel type of document classification task that quantifies how much a given document (review) appreciates the target object using not binary polarity (good or bad) b...
Daisuke Okanohara, Jun-ichi Tsujii
IJCNLP
2005
Springer
13 years 10 months ago
Automatic Image Annotation Using Maximum Entropy Model
Automatic image annotation is a newly developed and promising technique to provide semantic image retrieval via text descriptions. It concerns a process of automatically labeling t...
Wei Li, Maosong Sun
IJCNLP
2005
Springer
13 years 10 months ago
Web-Based Terminology Translation Mining
Mining terminology translation from a large amount of Web data can be applied in many fields such as reading/writing assistant, machine translation and cross-language information r...
Gaolin Fang, Hao Yu, Fumihito Nishino
IJCNLP
2005
Springer
13 years 10 months ago
Classifying Chinese Texts in Two Steps
Abstract. This paper proposes a two-step method for Chinese text categorization (TC). In the first step, a Naïve Bayesian classifier is used to fix the fuzzy area between two cate...
Xinghua Fan, Maosong Sun, Key-Sun Choi, Qin Zhang
IJCNLP
2005
Springer
13 years 10 months ago
A Method of Recognizing Entity and Relation
The entity and relation recognition, i.e. (1) assigning semantic classes to entities in a sentence, and (2) determining the relations held between entities, is an important task in...
Xinghua Fan, Maosong Sun
IJCNLP
2005
Springer
13 years 10 months ago
Automatic Acquisition of Basic Katakana Lexicon from a Given Corpus
Abstract. Katakana, Japanese phonogram mainly used for loan words, is a troublemaker in Japanese word segmentation. Since Katakana words are heavily domaindependent and there are m...
Toshiaki Nakazawa, Daisuke Kawahara, Sadao Kurohas...
IJCNLP
2005
Springer
13 years 10 months ago
A Case-Based Reasoning Approach for Speech Corpus Generation
Corpus-based stochastic language models have achieved significant success in speech recognition, but construction of a corpus pertaining to a specific application is a difficult ta...
Yandong Fan, Elizabeth A. Kendall