Sciweavers

602 search results - page 22 / 121
» Integrating Data and Probabilistically Structured Text Docum...
Sort
View
ICDE
2006
IEEE
428views Database» more  ICDE 2006»
15 years 10 months ago
Integrating Unstructured Data into Relational Databases
In this paper we present a system for automatically integrating unstructured text into a multi-relational database using state-of-the-art statistical models for structure extracti...
Imran R. Mansuri, Sunita Sarawagi
ICDAR
2003
IEEE
15 years 2 months ago
Discerning Structure from Freeform Handwritten Notes
This paper presents an integrated approach to parsing textual structure in freeform handwritten notes. Textgraphics classification and text layout analysis are classical problems ...
Michael Shilman, Zile Wei, Sashi Raghupathy, Patri...
SIGMOD
2007
ACM
144views Database» more  SIGMOD 2007»
15 years 9 months ago
The TopX DB&IR engine
This paper proposes a demo of the TopX search engine, an extensive framework for unified indexing, querying, and ranking of large collections of unstructured, semistructured, and ...
Martin Theobald, Ralf Schenkel, Gerhard Weikum
SIGIR
1999
ACM
15 years 1 months ago
Probabilistic Latent Semantic Indexing
Probabilistic Latent Semantic Indexing is a novel approach to automated document indexing which is based on a statistical latent class model for factor analysis of count data. Fit...
Thomas Hofmann
DMKD
2000
ACM
110views Data Mining» more  DMKD 2000»
15 years 1 months ago
Combining Strategies for Extracting Relations from Text Collections
Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
Eugene Agichtein, Eleazar Eskin, Luis Gravano