Sciweavers

1261 search results - page 112 / 253
» Extracting Text from PostScript
Sort
View
LREC
2008
97views Education» more  LREC 2008»
14 years 11 months ago
On Classifying Coherent/Incoherent Romanian Short Texts
In this paper we present and discuss the results of a text coherence experiment performed on a small corpus of Romanian text from a number of alternative high school manuals. Duri...
Anca Dinu
ACL
2003
14 years 11 months ago
An Ontology-based Semantic Tagger for IE system
In this paper, we present a method for the semantic tagging of word chunks extracted from a written transcription of conversations. This work is part of an ongoing project for an ...
Narjès Boufaden
PLDI
2010
ACM
15 years 7 months ago
A Context-free Markup Language for Semi-structured Text
An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...
Qian Xi, David Walker
SIGMOD
2008
ACM
134views Database» more  SIGMOD 2008»
15 years 10 months ago
SystemT: a system for declarative information extraction
As applications within and outside the enterprise encounter increasing volumes of unstructured data, there has been renewed interest in the area of information extraction (IE) ? t...
Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghava...
KDD
2004
ACM
160views Data Mining» more  KDD 2004»
15 years 10 months ago
Boosting for Text Classification with Semantic Features
Abstract. Current text classification systems typically use term stems for representing document content. Semantic Web technologies allow the usage of features on a higher semantic...
Stephan Bloehdorn, Andreas Hotho