Sciweavers

COLING
2002
13 years 4 months ago
Building a Large-Scale Annotated Chinese Corpus
In this paper we address issues related to building a large-scale Chinese corpus. We try to answer four questions: (i) how to speed up annotation, (ii) how to maintain high annota...
Nianwen Xue, Fu-Dong Chiou, Martha Stone Palmer
COLING
2002
13 years 4 months ago
Structure Alignment Using Bilingual Chunking
A new statistical method called "bilingual chunking" for structure alignment is proposed. Different with the existing approaches which align hierarchical structures like...
Wei Wang, Ming Zhou, Jin-Xia Huang, Changning Huan...
COLING
2002
13 years 4 months ago
A Chart-Parsing Algorithm for Efficient Semantic Analysis
In some contexts, well-formed natural language cannot be expected as input to information or communication systems. In these contexts, the use of grammar-independent input (sequen...
Pascal Vaillant
COLING
2002
13 years 4 months ago
Combining Unsupervised and Supervised Methods for PP Attachment Disambiguation
Statistical methods for PP attachment fall into two classes according to the training material used: first, unsupervised methods trained on raw text corpora and second, supervised...
Martin Volk
COLING
2002
13 years 4 months ago
Text Generation from Keywords
We describe a method for generating sentences from "keywords" or "headwords". This method consists of two main parts, candidate-text construction and evaluatio...
Kiyotaka Uchimoto, Satoshi Sekine, Hitoshi Isahara
COLING
2002
13 years 4 months ago
Morphological Analysis of the Spontaneous Speech Corpus
This paper describes a project tagging a spontaneous speech corpus with morphological information such as word segmentation and parts-ofspeech. We use a morphological analysis sys...
Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada...
COLING
2002
13 years 4 months ago
A Cheap and Fast Way to Build Useful Translation Lexicons
The paper presents a statistical approach to automatic building of translation lexicons from parallel corpora. We briefly describe the pre-processing steps, a baseline iterative m...
Dan Tufis
COLING
2002
13 years 4 months ago
Applying an NVEF Word-Pair Identifier to the Chinese Syllable-to-Word Conversion Problem
Syllable-to-word (STW) conversion is important in Chinese phonetic input methods and speech recognition. There are two major problems in the STW conversion: (1) resolving the ambi...
Jia-Lin Tsai, Wen-Lian Hsu
COLING
2002
13 years 4 months ago
Multi-Dimensional Text Classification
This paper proposes a multi-dimensional framework for classifying text documents. In this framework, the concept of multidimensional category model is introduced for representing ...
Thanaruk Theeramunkong, Verayuth Lertnattee