Sciweavers

EMNLP
2011
12 years 4 months ago
Approximate Scalable Bounded Space Sketch for Large Data NLP
We exploit sketch techniques, especially the Count-Min sketch, a memory, and time efficient framework which approximates the frequency of a word pair in the corpus without explic...
Amit Goyal, Hal Daumé III
EMNLP
2011
12 years 4 months ago
Class Label Enhancement via Related Instances
Class-instance label propagation algorithms have been successfully used to fuse information from multiple sources in order to enrich a set of unlabeled instances with class labels...
Zornitsa Kozareva, Konstantin Voevodski, Shang-Hua...
EMNLP
2011
12 years 4 months ago
Predicting Thread Discourse Structure over Technical Web Forums
Li Wang, Marco Lui, Su Nam Kim, Joakim Nivre, Timo...
EMNLP
2011
12 years 4 months ago
Improved Transliteration Mining Using Graph Reinforcement
Ali El Kahki, Kareem Darwish, Ahmed Saad El Din, M...
EMNLP
2011
12 years 4 months ago
Unsupervised Dependency Parsing without Gold Part-of-Speech Tags
We show that categories induced by unsupervised word clustering can surpass the performance of gold part-of-speech tags in dependency grammar induction. Unlike classic clustering ...
Valentin I. Spitkovsky, Hiyan Alshawi, Angel X. Ch...
EMNLP
2011
12 years 4 months ago
Lateen EM: Unsupervised Training with Multiple Objectives, Applied to Dependency Grammar Induction
We present new training methods that aim to mitigate local optima and slow convergence in unsupervised training by using additional imperfect objectives. In its simplest form, lat...
Valentin I. Spitkovsky, Hiyan Alshawi, Daniel Jura...