Sciweavers

CICLING
2007
Springer
13 years 10 months ago
A Competitive Term Selection Method for Information Retrieval
Term selection process is a very necessary component for most natural language processing tasks. Although different unsupervised techniques have been proposed, the best results ar...
Franco Rojas López, Héctor Jim&eacut...
CICLING
2007
Springer
13 years 10 months ago
Text Categorization for Improved Priors of Word Meaning
Distributions of the senses of words are often highly skewed. This fact is exploited by word sense disambiguation (WSD) systems which back off to the predominant (most frequent) s...
Rob Koeling, Diana McCarthy, John Carroll
CICLING
2007
Springer
13 years 10 months ago
Latent Variable Models for Causal Knowledge Acquisition
Takashi Inui, Hiroya Takamura, Manabu Okumura
CICLING
2007
Springer
13 years 10 months ago
Baby-Steps Towards Building a Spanglish Language Model
Abstract. Spanglish is the simultaneous use, or alternating of both, traditional Spanish and English within the same conversational event. This interlanguage is commonly used in U....
Juan Carlos Franco, Thamar Solorio
CICLING
2007
Springer
13 years 10 months ago
Dependency Analysis and CBR to Bridge the Generation Gap in Template-Based NLG
The present paper describes how dependency analysis can be used to automatically extract from a corpus a set of cases - and an accompanying vocabulary - which enable a template-bas...
Virginia Francisco, Raquel Hervás, Pablo Ge...
CICLING
2007
Springer
13 years 10 months ago
On the Impact of Lexical and Linguistic Features in Genre- and Domain-Based Categorization
Abstract. Classification in genres and domains is a major field of research for Information Retrieval (scientific and technical watch, datamining, etc.) and the selection of app...
Guillaume Cleuziou, Céline Poudat
CICLING
2007
Springer
13 years 10 months ago
NEO-CORTEX: A Performant User-Oriented Multi-Document Summarization System
Abstract. This paper discusses an approach to topic-oriented multidocument summarization. It investigates the effectiveness of using additional information about the document set ...
Florian Boudin, Juan Manuel Torres Moreno
CICLING
2007
Springer
13 years 10 months ago
Enhancing Cross-Language Question Answering by Combining Multiple Question Translations
One major problem of state-of-the-art Cross Language Question Answering systems is the translation of user questions. This paper proposes combining the potential of multiple transl...
Rita M. Aceves-Pérez, Manuel Montes-y-G&oac...
CICLING
2007
Springer
13 years 10 months ago
Adapting the JIRS Passage Retrieval System to the Arabic Language
The need of having a Passage Retrieval (PR) system for Arabic texts is due essentially to our aim to build an Arabic Question Answering (QA) system in our research team. We have ch...
Yassine Benajiba, Paolo Rosso, José Manuel ...
CICLING
2007
Springer
13 years 10 months ago
ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy
Abstract. The task of Named Entity Recognition (NER) allows to identify proper names as well as temporal and numeric expressions, in an open-domain text. NER systems proved to be v...
Yassine Benajiba, Paolo Rosso, José-Miguel ...