Sciweavers

EMNLP
2004
13 years 6 months ago
Unsupervised Domain Relevance Estimation for Word Sense Disambiguation
This paper presents Domain Relevance Estimation (DRE), a fully unsupervised text categorization technique based on the statistical estimation of the relevance of a text with respe...
Alfio Massimiliano Gliozzo, Bernardo Magnini, Carl...
EMNLP
2004
13 years 6 months ago
Bilingual Parsing with Factored Estimation: Using English to Parse Korean
We describe how simple, commonly understood statistical models, such as statistical dependency parsers, probabilistic context-free grammars, and word-to-word translation models, c...
David A. Smith, Noah A. Smith
EMNLP
2004
13 years 6 months ago
The Entropy Rate Principle as a Predictor of Processing Effort: An Evaluation against Eye-tracking Data
This paper provides evidence for Genzel and Charniak's (2002) entropy rate principle, which predicts that the entropy of a sentence increases with its position in the text. W...
Frank Keller
EMNLP
2004
13 years 6 months ago
Learning Hebrew Roots: Machine Learning with Linguistic Constraints
The morphology of Semitic languages is unique in the sense that the major word-formation mechanism is an inherently non-concatenative process of interdigitation, whereby two morph...
Ezra Daya, Dan Roth, Shuly Wintner
EMNLP
2004
13 years 6 months ago
Object-Extraction and Question-Parsing using CCG
Accurate dependency recovery has recently been reported for a number of wide-coverage statistical parsers using Combinatory Categorial Grammar (CCG). However, overall figures give...
Stephen Clark, Mark Steedman, James R. Curran
EMNLP
2004
13 years 6 months ago
Evaluating Information Content by Factoid Analysis: Human annotation and stability
We present a new approach to intrinsic summary evaluation, based on initial experiments in van Halteren and Teufel (2003), which combines two novel aspects: comparison of informat...
Simone Teufel, Hans van Halteren
EMNLP
2004
13 years 6 months ago
A Boosting Algorithm for Classification of Semi-Structured Text
The focus of research in text classification has expanded from simple topic identification to more challenging tasks such as opinion/modality identification. Unfortunately, the la...
Taku Kudo, Yuji Matsumoto
EMNLP
2004
13 years 6 months ago
Scaling Web-based Acquisition of Entailment Relations
Paraphrase recognition is a critical step for natural language interpretation. Accordingly, many NLP applications would benefit from high coverage knowledge bases of paraphrases. ...
Idan Szpektor, Hristo Tanev, Ido Dagan, Bonaventur...
EMNLP
2004
13 years 6 months ago
Max-Margin Parsing
We present a novel discriminative approach to parsing inspired by the large-margin criterion underlying support vector machines. Our formulation uses a factorization analogous to ...
Ben Taskar, Dan Klein, Mike Collins, Daphne Koller...