Sciweavers

137
Voted
ACL
2012
13 years 7 months ago
Native Language Detection with Tree Substitution Grammars
We investigate the potential of Tree Substitution Grammars as a source of features for native language detection, the task of inferring an author’s native language from text in ...
Benjamin Swanson, Eugene Charniak
ACL
2012
13 years 7 months ago
CSNIPER - Annotation-by-query for Non-canonical Constructions in Large Corpora
We present CSNIPER (Corpus Sniper), a tool that implements (i) a web-based multiuser scenario for identifying and annotating non-canonical grammatical constructions in large corpo...
Richard Eckart de Castilho, Sabine Bartsch, Iryna ...
151
Voted
ACL
2012
13 years 7 months ago
Joint Learning of a Dual SMT System for Paraphrase Generation
SMT has been used in paraphrase generation by translating a source sentence into another (pivot) language and then back into the source. The resulting sentences can be used as can...
Hong Sun, Ming Zhou
ACL
2012
13 years 7 months ago
A Novel Burst-based Text Representation Model for Scalable Event Detection
Mining retrospective events from text streams has been an important research topic. Classic text representation model (i.e., vector space model) cannot model temporal aspects of d...
Xin Zhao, Rishan Chen, Kai Fan, Hongfei Yan, Xiaom...
161
Voted
ACL
2012
13 years 7 months ago
Semantic Parsing with Bayesian Tree Transducers
Many semantic parsing models use tree transformations to map between natural language and meaning representation. However, while tree transformations are central to several state-...
Bevan K. Jones, Mark Johnson, Sharon Goldwater
ACL
2012
13 years 7 months ago
Syntactic Annotations for the Google Books NGram Corpus
We present a new edition of the Google Books Ngram Corpus, which describes how often words and phrases were used over a period of five centuries, in eight languages; it reflects...
Yuri Lin, Jean-Baptiste Michel, Erez Aiden Lieberm...
98
Voted
ACL
2012
13 years 7 months ago
Multilingual WSD with Just a Few Lines of Code: the BabelNet API
Roberto Navigli, Simone Paolo Ponzetto
ACL
2012
13 years 7 months ago
Named Entity Disambiguation in Streaming Data
The named entity disambiguation task is to resolve the many-to-many correspondence between ambiguous names and the unique realworld entity. This task can be modeled as a classifi...
Alexandre Davis, Adriano Veloso, Altigran Soares d...
140
Voted
ACL
2012
13 years 7 months ago
Hierarchical Chunk-to-String Translation
We present a hierarchical chunk-to-string translation model, which can be seen as a compromise between the hierarchical phrasebased model and the tree-to-string model, to combine ...
Yang Feng, Dongdong Zhang, Mu Li, Qun Liu
ACL
2012
13 years 7 months ago
Learning to Find Translations and Transliterations on the Web
In this paper, we present a new method for learning to finding translations and transliterations on the Web for a given term. The approach involves using a small set of terms and ...
Joseph Z. Chang, Jason S. Chang, Jyh-Shing Roger J...