Sciweavers

COLING
2002
14 years 10 months ago
Structure Alignment Using Bilingual Chunking
A new statistical method called "bilingual chunking" for structure alignment is proposed. Different with the existing approaches which align hierarchical structures like...
Wei Wang, Ming Zhou, Jin-Xia Huang, Changning Huan...
80
Voted
COLING
2002
14 years 10 months ago
A Chart-Parsing Algorithm for Efficient Semantic Analysis
In some contexts, well-formed natural language cannot be expected as input to information or communication systems. In these contexts, the use of grammar-independent input (sequen...
Pascal Vaillant
71
Voted
COLING
2002
14 years 10 months ago
Combining Unsupervised and Supervised Methods for PP Attachment Disambiguation
Statistical methods for PP attachment fall into two classes according to the training material used: first, unsupervised methods trained on raw text corpora and second, supervised...
Martin Volk
COLING
2002
14 years 10 months ago
Text Generation from Keywords
We describe a method for generating sentences from "keywords" or "headwords". This method consists of two main parts, candidate-text construction and evaluatio...
Kiyotaka Uchimoto, Satoshi Sekine, Hitoshi Isahara
84
Voted
COLING
2002
14 years 10 months ago
Morphological Analysis of the Spontaneous Speech Corpus
This paper describes a project tagging a spontaneous speech corpus with morphological information such as word segmentation and parts-ofspeech. We use a morphological analysis sys...
Kiyotaka Uchimoto, Chikashi Nobata, Atsushi Yamada...
COLING
2002
14 years 10 months ago
A Cheap and Fast Way to Build Useful Translation Lexicons
The paper presents a statistical approach to automatic building of translation lexicons from parallel corpora. We briefly describe the pre-processing steps, a baseline iterative m...
Dan Tufis
COLING
2002
14 years 10 months ago
Applying an NVEF Word-Pair Identifier to the Chinese Syllable-to-Word Conversion Problem
Syllable-to-word (STW) conversion is important in Chinese phonetic input methods and speech recognition. There are two major problems in the STW conversion: (1) resolving the ambi...
Jia-Lin Tsai, Wen-Lian Hsu
COLING
2002
14 years 10 months ago
Multi-Dimensional Text Classification
This paper proposes a multi-dimensional framework for classifying text documents. In this framework, the concept of multidimensional category model is introduced for representing ...
Thanaruk Theeramunkong, Verayuth Lertnattee
COLING
2002
14 years 10 months ago
Shallow Language Processing Architecture for Bulgarian
This paper describes LINGUA - an architecture for text processing in Bulgarian. First, the pre-processing modules for tokenisation, sentence splitting, paragraph segmentation, par...
Hristo Tanev, Ruslan Mitkov