Sciweavers

EACL
2006
ACL Anthology
13 years 6 months ago
Classifying Biological Full-Text Articles for Multi-Database Curation
In this paper, we propose an approach for identifying curatable articles from a large document set. This system considers three parts of an article (title ract, MeSH terms, and ca...
Wen-Juan Hou, Chih Lee, Hsin-Hsi Chen
EACL
2006
ACL Anthology
13 years 6 months ago
Automatically Constructing a Lexicon of Verb Phrase Idiomatic Combinations
We investigate the lexical and syntactic flexibility of a class of idiomatic expressions. We develop measures that draw on such linguistic properties, and demonstrate that these s...
Afsaneh Fazly, Suzanne Stevenson
EACL
2006
ACL Anthology
13 years 6 months ago
Improving Probabilistic Latent Semantic Analysis with Principal Component Analysis
Probabilistic Latent Semantic Analysis (PLSA) models have been shown to provide a better model for capturing polysemy and synonymy than Latent Semantic Analysis (LSA). However, th...
Ayman Farahat, Francine Chen
EACL
2006
ACL Anthology
13 years 6 months ago
Determining Term Subjectivity and Term Orientation for Opinion Mining
Opinion mining is a recent subdiscipline of computational linguistics which is concerned not with the topic a document is about, but with the opinion it expresses. To aid the extr...
Andrea Esuli, Fabrizio Sebastiani
EACL
2006
ACL Anthology
13 years 6 months ago
Recognizing Textual Parallelisms with Edit Distance and Similarity Degree
Detection of discourse structure is crucial in many text-based applications. This paper presents an original framework for describing textual parallelism which allows us to genera...
Marie Guégan, Nicolas Hernandez
EACL
2006
ACL Anthology
13 years 6 months ago
From Detecting Errors to Automatically Correcting Them
Faced with the problem of annotation errors in part-of-speech (POS) annotated corpora, we develop a method for automatically correcting such errors. Building on top of a successfu...
Markus Dickinson
EACL
2006
ACL Anthology
13 years 6 months ago
Information Presentation in Spoken Dialogue Systems
To tackle the problem of presenting a large number of options in spoken dialogue systems, we identify compelling options based on a model of user preferences, and present tradeoff...
Vera Demberg, Johanna D. Moore
EACL
2006
ACL Anthology
13 years 6 months ago
Automatic Acronym Recognition
This paper deals with the problem of recognizing and extracting acronymdefinition pairs in Swedish medical texts. This project applies a rule-based method to solve the acronym rec...
Dana Dannélls
EACL
2006
ACL Anthology
13 years 6 months ago
Esfinge a Question Answering System in the Web using the Web
Esfinge is a general domain Portuguese question answering system. It tries to take advantage of the great amount of information existent in the World Wide Web. Since Portuguese is...
Luís Fernando Costa
EACL
2006
ACL Anthology
13 years 6 months ago
A Figure of Merit for the Evaluation of Web-Corpus Randomness
In this paper, we present an automated, quantitative, knowledge-poor method to evaluate the randomness of a collection of documents (corpus), with respect to a number of biased pa...
Massimiliano Ciaramita, Marco Baroni