Sciweavers

CORR
2002
Springer
84views Education» more  CORR 2002»
13 years 4 months ago
A Method for Open-Vocabulary Speech-Driven Text Retrieval
While recent retrieval techniques do not
Atsushi Fujii, Katunobu Itou, Tetsuya Ishikawa
CORR
2002
Springer
72views Education» more  CORR 2002»
13 years 4 months ago
Using the Annotated Bibliography as a Resource for Indicative Summarization
We report on a language resource consisting of 2000 annotated bibliography entries, which is being analyzed as part of our research on indicative document summarization. We show h...
Min-Yen Kan, Judith L. Klavans, Kathleen McKeown
CORR
2002
Springer
95views Education» more  CORR 2002»
13 years 4 months ago
Unsupervised Learning of Morphology without Morphemes
The first morphological learner based upon the theory of Whole Word Morphology (Ford et al., 1997) is outlined, and preliminary evaluation results are presented. The program, Whol...
Sylvain Neuvel, Sean A. Fulop
CORR
2002
Springer
96views Education» more  CORR 2002»
13 years 4 months ago
Thumbs up? Sentiment Classification using Machine Learning Techniques
We consider the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative. Using movie reviews as data, w...
Bo Pang, Lillian Lee, Shivakumar Vaithyanathan
CORR
2002
Springer
84views Education» more  CORR 2002»
13 years 4 months ago
Evaluating the Effectiveness of Ensembles of Decision Trees in Disambiguating Senseval Lexical Samples
This paper presents an evaluation of an ensemble
Ted Pedersen
CORR
2002
Springer
97views Education» more  CORR 2002»
13 years 4 months ago
Bootstrapping Lexical Choice via Multiple-Sequence Alignment
An important component of any generation system is the mapping dictionary, a lexicon of elementary semantic expressions and corresponding natural language realizations. Typically,...
Regina Barzilay, Lillian Lee
CORR
2002
Springer
126views Education» more  CORR 2002»
13 years 4 months ago
Unsupervised Discovery of Morphemes
We present two methods for unsupervised segmentation of words into morphemelike units. The model utilized is especially suited for languages with a rich morphology, such as Finnis...
Mathias Creutz, Krista Lagus
CORR
2002
Springer
93views Education» more  CORR 2002»
13 years 4 months ago
Ellogon: A New Text Engineering Platform
This paper presents Ellogon, a multi-lingual, cross-platform, general-purpose text engineering environment. Ellogon was designed in order to aid both researchers in natural langua...
Georgios Petasis, Vangelis Karkaletsis, Georgios P...
CORR
2002
Springer
90views Education» more  CORR 2002»
13 years 4 months ago
Mostly-Unsupervised Statistical Segmentation of Japanese Kanji Sequences
Given the lack of word delimiters in written Japanese, word segmentation is generally considered a crucial first step in processing Japanese texts. Typical Japanese segmentation a...
Rie Kubota Ando, Lillian Lee