Sciweavers

EMNLP
2004
13 years 6 months ago
Error Measures and Bayes Decision Rules Revisited with Applications to POS Tagging
Starting from first principles, we re-visit the statistical approach and study two forms of the Bayes decision rule: the common rule for minimizing the number of string errors and...
Hermann Ney, Maja Popovic, David Sündermann
EMNLP
2004
13 years 6 months ago
From Machine Translation to Computer Assisted Translation using Finite-State Models
State-of-the-art machine translation techniques are still far from producing high quality translations. This drawback leads us to introduce an alternative approach to the translat...
Jorge Civera, Elsa Cubel, Antonio L. Lagarda, Davi...
EMNLP
2004
13 years 6 months ago
Efficient Decoding for Statistical Machine Translation with a Fully Expanded WFST Model
This paper proposes a novel method to compile statistical models for machine translation to achieve efficient decoding. In our method, each statistical submodel is represented by ...
Hajime Tsukada, Masaaki Nagata
EMNLP
2004
13 years 6 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
EMNLP
2004
13 years 6 months ago
Applying Conditional Random Fields to Japanese Morphological Analysis
This paper presents Japanese morphological analysis based on conditional random fields (CRFs). Previous work in CRFs assumed that observation sequence (word) boundaries were fixed...
Taku Kudo, Kaoru Yamamoto, Yuji Matsumoto
EMNLP
2004
13 years 6 months ago
Instance-Based Question Answering: A Data-Driven Approach
Anticipating the availability of large questionanswer datasets, we propose a principled, datadriven Instance-Based approach to Question Answering. Most question answering systems ...
Lucian Vlad Lita, Jaime G. Carbonell
EMNLP
2007
13 years 6 months ago
What Can Syntax-Based MT Learn from Phrase-Based MT?
We compare and contrast the strengths and weaknesses of a syntax-based machine translation model with a phrase-based machine translation model on several levels. We briefly descr...
Steve DeNeefe, Kevin Knight, Wei Wang 0006, Daniel...
EMNLP
2007
13 years 6 months ago
Smooth Bilingual N-Gram Translation
We address the problem of smoothing translation probabilities in a bilingual N-grambased statistical machine translation system. It is proposed to project the bilingual tuples ont...
Holger Schwenk, Marta R. Costa-Jussà, Jos&e...
EMNLP
2007
13 years 6 months ago
Exploiting Wikipedia as External Knowledge for Named Entity Recognition
We explore the use of Wikipedia as external knowledge to improve named entity recognition (NER). Our method retrieves the corresponding Wikipedia entry for each candidate word seq...
Jun'ichi Kazama, Kentaro Torisawa
EMNLP
2007
13 years 6 months ago
Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features
This paper discusses automatic determination of case in Arabic. This task is an important part and major source of errors in full diacritization of Arabic. We use a goldstandard s...
Nizar Habash, Ryan Gabbard, Owen Rambow, Seth Kuli...