Sciweavers

EMNLP
2004
13 years 5 months ago
Induction of Greedy Controllers for Deterministic Treebank Parsers
Most statistical parsers have used the grammar induction approach, in which a stochastic grammar is induced from a treebank. An alternative approach is to induce a controller for ...
Tom Kalt
EMNLP
2004
13 years 5 months ago
TextRank: Bringing Order into Text
In this paper, we introduce TextRank
Rada Mihalcea, Paul Tarau
EMNLP
2004
13 years 5 months ago
Statistical Significance Tests for Machine Translation Evaluation
If two translation systems differ differ in performance on a test set, can we trust that this indicates a difference in true system quality? To answer this question, we describe b...
Philipp Koehn
EMNLP
2004
13 years 5 months ago
Dependencies vs. Constituents for Tree-Based Alignment
Given a parallel parsed corpus, statistical treeto-tree alignment attempts to match nodes in the syntactic trees for a given sentence in two languages. We train a probabilistic tr...
Daniel Gildea
EMNLP
2004
13 years 5 months ago
Error Measures and Bayes Decision Rules Revisited with Applications to POS Tagging
Starting from first principles, we re-visit the statistical approach and study two forms of the Bayes decision rule: the common rule for minimizing the number of string errors and...
Hermann Ney, Maja Popovic, David Sündermann
EMNLP
2004
13 years 5 months ago
From Machine Translation to Computer Assisted Translation using Finite-State Models
State-of-the-art machine translation techniques are still far from producing high quality translations. This drawback leads us to introduce an alternative approach to the translat...
Jorge Civera, Elsa Cubel, Antonio L. Lagarda, Davi...
EMNLP
2004
13 years 5 months ago
Efficient Decoding for Statistical Machine Translation with a Fully Expanded WFST Model
This paper proposes a novel method to compile statistical models for machine translation to achieve efficient decoding. In our method, each statistical submodel is represented by ...
Hajime Tsukada, Masaaki Nagata
EMNLP
2004
13 years 5 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
EMNLP
2004
13 years 5 months ago
Applying Conditional Random Fields to Japanese Morphological Analysis
This paper presents Japanese morphological analysis based on conditional random fields (CRFs). Previous work in CRFs assumed that observation sequence (word) boundaries were fixed...
Taku Kudo, Kaoru Yamamoto, Yuji Matsumoto
EMNLP
2004
13 years 5 months ago
Instance-Based Question Answering: A Data-Driven Approach
Anticipating the availability of large questionanswer datasets, we propose a principled, datadriven Instance-Based approach to Question Answering. Most question answering systems ...
Lucian Vlad Lita, Jaime G. Carbonell