Sciweavers

COLING
2010
13 years 8 days ago
Tree Topological Features for Unlexicalized Parsing
As unlexicalized parsing lacks word token information, it is important to investigate novel parsing features to improve the accuracy. This paper studies a set of tree topological ...
Samuel W. K. Chan, Lawrence Y. L. Cheung, Mickey W...
COLING
2010
13 years 8 days ago
Learning Web Query Patterns for Imitating Wikipedia Articles
This paper presents a novel method for acquiring a set of query patterns to retrieve documents containing important information about an entity. Given an existing Wikipedia catego...
Shohei Tanaka, Naoaki Okazaki, Mitsuru Ishizuka
COLING
2010
13 years 8 days ago
The Role of Queries in Ranking Labeled Instances Extracted from Text
A weakly supervised method uses anonymized search queries to induce a ranking among class labels extracted from unstructured text for various instances. The accuracy of the extrac...
Marius Pasca
COLING
2010
13 years 8 days ago
Linguistic Cues for Distinguishing Literal and Non-Literal Usages
We investigate the effectiveness of different linguistic cues for distinguishing literal and non-literal usages of potentially idiomatic expressions. We focus specifically on feat...
Linlin Li, Caroline Sporleder
COLING
2010
13 years 8 days ago
Benchmarking for syntax-based sentential inference
We propose a methodology for investigating how well NLP systems handle meaning preserving syntactic variations. We start by presenting a method for the semi automated creation of ...
Paul Bédaride, Claire Gardent
COLING
2010
13 years 8 days ago
Verbs are where all the action lies: Experiences of Shallow Parsing of a Morphologically Rich Language
Verb suffixes and verb complexes of morphologically rich languages carry a lot of information. We show that this information if harnessed for the task of shallow parsing can lead ...
Harshada Gune, Mugdha Bapat, Mitesh M. Khapra, Pus...
COLING
2010
13 years 8 days ago
Challenges from Information Extraction to Information Fusion
Information Extraction (IE) technology is facing new challenges of dealing with large-scale heterogeneous data sources from different documents, languages and modalities. Informat...
Heng Ji
COLING
2010
13 years 8 days ago
Unsupervised cleansing of noisy text
In this paper we look at the problem of cleansing noisy text using a statistical machine translation model. Noisy text is produced in informal communications such as Short Message...
Danish Contractor, Tanveer A. Faruquie, L. Venkata...
COLING
2010
13 years 8 days ago
Heterogeneous Parsing via Collaborative Decoding
There often exist multiple corpora for the same natural language processing (NLP) tasks. However, such corpora are generally used independently due to distinctions in annotation s...
Muhua Zhu, Jingbo Zhu, Tong Xiao