Sciweavers

EMNLP
2010
13 years 2 months ago
Discriminative Sample Selection for Statistical Machine Translation
Production of parallel training corpora for the development of statistical machine translation (SMT) systems for resource-poor languages usually requires extensive manual effort. ...
Sankaranarayanan Ananthakrishnan, Rohit Prasad, Da...
EMNLP
2010
13 years 2 months ago
Efficient Graph-Based Semi-Supervised Learning of Structured Tagging Models
We describe a new scalable algorithm for semi-supervised training of conditional random fields (CRF) and its application to partof-speech (POS) tagging. The algorithm uses a simil...
Amarnag Subramanya, Slav Petrov, Fernando Pereira
EMNLP
2010
13 years 2 months ago
Two Decades of Unsupervised POS Induction: How Far Have We Come?
Part-of-speech (POS) induction is one of the most popular tasks in research on unsupervised NLP. Many different methods have been proposed, yet comparisons are difficult to make s...
Christos Christodoulopoulos, Sharon Goldwater, Mar...
EMNLP
2010
13 years 2 months ago
Unsupervised Discovery of Negative Categories in Lexicon Bootstrapping
Multi-category bootstrapping algorithms were developed to reduce semantic drift. By extracting multiple semantic lexicons simultaneously, a category's search space may be res...
Tara McIntosh
EMNLP
2010
13 years 2 months ago
Automatic Detection and Classification of Social Events
In this paper we introduce the new task of social event extraction from text. We distinguish two broad types of social events depending on whether only one or both parties are awa...
Apoorv Agarwal, Owen Rambow
EMNLP
2010
13 years 2 months ago
Improving Translation via Targeted Paraphrasing
Targeted paraphrasing is a new approach to the problem of obtaining cost-effective, reasonable quality translation that makes use of simple and inexpensive human computations by m...
Philip Resnik, Olivia Buzek, Chang Hu, Yakov Kronr...
EMNLP
2010
13 years 2 months ago
A Fast Fertility Hidden Markov Model for Word Alignment Using MCMC
A word in one language can be translated to zero, one, or several words in other languages. Using word fertility features has been shown to be useful in building word alignment mo...
Shaojun Zhao, Daniel Gildea
EMNLP
2010
13 years 2 months ago
Crouching Dirichlet, Hidden Markov Model: Unsupervised POS Tagging with Context Local Tag Generation
We define the crouching Dirichlet, hidden Markov model (CDHMM), an HMM for partof-speech tagging which draws state prior distributions for each local document context. This simple...
Taesun Moon, Katrin Erk, Jason Baldridge
EMNLP
2010
13 years 2 months ago
Unsupervised Parse Selection for HPSG
Parser disambiguation with precision grammars generally takes place via statistical ranking of the parse yield of the grammar using a supervised parse selection model. In the stan...
Rebecca Dridan, Timothy Baldwin