Sciweavers

602 search results - page 19 / 121
» Integrating Data and Probabilistically Structured Text Docum...
Sort
View
NAACL
2007
14 years 11 months ago
Multilingual Structural Projection across Interlinear Text
This paper explores the potential for annotating and enriching data for low-density languages via the alignment and projection of syntactic structure from parsed data for resource...
Fei Xia, William Lewis
173
Voted
ICDAR
2011
IEEE
13 years 9 months ago
Aletheia - An Advanced Document Layout and Text Ground-Truthing System for Production Environments
- Large-scale digitisation has led to a number of new possibilities with regard to adaptive and learning based methods in the field of Document Image Analysis and OCR. For ground t...
C. Clausner, Stefan Pletschacher, Apostolos Antona...
SIGMOD
2001
ACM
145views Database» more  SIGMOD 2001»
15 years 9 months ago
Automatic Segmentation of Text into Structured Records
In this paper we present a method for automatically segmenting unformatted text records into structured elements. Several useful data sources today are human-generated as continuo...
Vinayak R. Borkar, Kaustubh Deshmukh, Sunita Saraw...
RIAO
2007
14 years 11 months ago
Comprehensible and Accurate Cluster Labels in Text Clustering
The purpose of text clustering in information retrieval is to discover groups of semantically related documents. Accurate and comprehensible cluster descriptions (labels) let the ...
Jerzy Stefanowski, Dawid Weiss
74
Voted
ESA
2010
Springer
161views Algorithms» more  ESA 2010»
14 years 10 months ago
Top-k Ranked Document Search in General Text Databases
Text search engines return a set of k documents ranked by similarity to a query. Typically, documents and queries are drawn from natural language text, which can readily be partiti...
J. Shane Culpepper, Gonzalo Navarro, Simon J. Pugl...