Sciweavers

602 search results - page 9 / 121
» Integrating Data and Probabilistically Structured Text Docum...
Sort
View
ICDE
2010
IEEE
273views Database» more  ICDE 2010»
15 years 9 months ago
WikiAnalytics: Ad-hoc Querying of Highly Heterogeneous Structured Data
Searching and extracting meaningful information out of highly heterogeneous datasets is a hot topic that received a lot of attention. However, the existing solutions are based on e...
Andrey Balmin, Emiran Curtmola
SIGIR
2000
ACM
15 years 1 months ago
On the design and evaluation of a multi-dimensional approach to information retrieval
We present a method of searching text collections that takes advantage of hierarchrical information within documents and integrates searches of structured and unstructured data. W...
M. Catherine McCabe, Jinho Lee, Abdur Chowdhury, D...
CIKM
2008
Springer
14 years 11 months ago
Modeling hidden topics on document manifold
Topic modeling has been a key problem for document analysis. One of the canonical approaches for topic modeling is Probabilistic Latent Semantic Indexing, which maximizes the join...
Deng Cai, Qiaozhu Mei, Jiawei Han, Chengxiang Zhai
AAAI
1998
14 years 10 months ago
Learning to Classify Text from Labeled and Unlabeled Documents
In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...
Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...
VLDB
1994
ACM
147views Database» more  VLDB 1994»
15 years 1 months ago
Integrating a Structured-Text Retrieval System with an Object-Oriented Database System
We describe the integration of a structuredtext retrieval system (TextMachine) into an object-oriented database system (OpenODB). We use the external function capability of the da...
Tak W. Yan, Jurgen Annevelink