Sciweavers

EMNLP
2010
13 years 2 months ago
Improving Mention Detection Robustness to Noisy Input
Information-extraction (IE) research typically focuses on clean-text inputs. However, an IE engine serving real applications yields many false alarms due to less-well-formed input...
Radu Florian, John F. Pitrelli, Salim Roukos, Imed...
EMNLP
2010
13 years 2 months ago
Further Meta-Evaluation of Broad-Coverage Surface Realization
We present the first evaluation of the utility of automatic evaluation metrics on surface realizations of Penn Treebank data. Using outputs of the OpenCCG and XLE realizers, along...
Dominic Espinosa, Rajakrishnan Rajkumar, Michael W...
EMNLP
2010
13 years 2 months ago
Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval
We present three novel methods of compactly storing very large n-gram language models. These methods use substantially less space than all known approaches and allow n-gram probab...
David Guthrie, Mark Hepple
EMNLP
2010
13 years 2 months ago
Context Comparison of Bursty Events in Web Search and Online Media
In this paper, we conducted a systematic comparative analysis of language in different contexts of bursty topics, including web search, news media, blogging, and social bookmarkin...
Yunliang Jiang, Cindy Xide Lin, Qiaozhu Mei
EMNLP
2010
13 years 2 months ago
Mining Name Translations from Entity Graph Mapping
This paper studies the problem of mining entity translation, specifically, mining English and Chinese name pairs. Existing efforts can be categorized into (a) a transliterationbas...
Gae-won You, Seung-won Hwang, Young-In Song, Long ...
EMNLP
2010
13 years 2 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
EMNLP
2010
13 years 2 months ago
Discriminative Word Alignment with a Function Word Reordering Model
We address the modeling, parameter estimation and search challenges that arise from the
Hendra Setiawan, Christopher Dyer, Philip Resnik
EMNLP
2010
13 years 2 months ago
It Depends on the Translation: Unsupervised Dependency Parsing via Word Alignment
We reveal a previously unnoticed connection between dependency parsing and statistical machine translation (SMT), by formulating the dependency parsing task as a problem of word a...
Samuel Brody
EMNLP
2010
13 years 2 months ago
Learning the Relative Usefulness of Questions in Community QA
We present a machine learning approach for the task of ranking previously answered questions in a question repository with respect to their relevance to a new, unanswered referenc...
Razvan C. Bunescu, Yunfeng Huang
EMNLP
2010
13 years 2 months ago
Latent-Descriptor Clustering for Unsupervised POS Induction
We present a novel approach to distributionalonly, fully unsupervised, POS tagging, based on an adaptation of the EM algorithm for the estimation of a Gaussian mixture. In this ap...
Michael Lamar, Yariv Maron, Elie Bienenstock