Sciweavers

1261 search results - page 113 / 253
» Extracting Text from PostScript
Sort
View
LREC
2008
70views Education» more  LREC 2008»
14 years 11 months ago
Process Model for Composing High-quality Text Corpora
The Teko corpus composing model offers a decentralized, dynamic way of collecting high-quality text corpora for linguistic research. The resulting corpus consists of independent t...
Mikko Lounela
ASP
2005
Springer
14 years 12 months ago
Exploiting ASP for Semantic Information Extraction
Abstract. The paper describes HıLεX, a new ASP-based system for the extraction of information from unstructured documents. Unlike previous systems, which are mainly syntactic, H...
Massimo Ruffolo, Nicola Leone, Marco Manna, Domeni...
IR
2006
14 years 10 months ago
Table extraction for answer retrieval
The ability to find tables and extract information from them is a necessary component of many information retrieval tasks. Documents often contain tables in order to communicate d...
Xing Wei, W. Bruce Croft, Andrew McCallum
ESANN
2007
14 years 11 months ago
Kernel PCA based clustering for inducing features in text categorization
We study dimensionality reduction or feature selection in text document categorization problem. We focus on the first step in building text categorization systems, that is the cho...
Zsolt Minier, Lehel Csató
EMNLP
2010
14 years 8 months ago
Summarizing Contrastive Viewpoints in Opinionated Text
This paper presents a two-stage approach to summarizing multiple contrastive viewpoints in opinionated text. In the first stage, we use an unsupervised probabilistic approach to m...
Michael Paul, ChengXiang Zhai, Roxana Girju