Sciweavers

374 search results - page 10 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
WIAMIS
2009
IEEE
15 years 6 months ago
Automatic topic detection strategy for information retrieval in spoken document
This paper suggests an alternative solution for the task of spoken document retrieval (SDR). The proposed system runs retrieval on multi-level transcriptions (word and phone) prod...
Shan Jin, Hemant Misra, Thomas Sikora, Joemon M. J...
SIGIR
2000
ACM
15 years 4 months ago
An investigation of linguistic features and clustering algorithms for topical document clustering
We investigate four hierarchical clustering methods (single-link, complete-link, groupwise-average, and single-pass) and two linguistically motivated text features (noun phrase he...
Vasileios Hatzivassiloglou, Luis Gravano, Ankineed...
WWW
2008
ACM
16 years 10 days ago
Automatic web image selection with a probabilistic latent topic model
We propose a new method to select relevant images to the given keywords from images gathered from the Web based on the Probabilistic Latent Semantic Analysis (PLSA) model which is...
Keiji Yanai
ICDAR
2007
IEEE
15 years 6 months ago
Context-Sensitive Error Correction: Using Topic Models to Improve OCR
Modern optical character recognition software relies on human interaction to correct misrecognized characters. Even though the software often reliably identifies low-confidence ...
Michael L. Wick, Michael G. Ross, Erik G. Learned-...
AAAI
2010
14 years 9 months ago
A Topic Model for Linked Documents and Update Rules for its Estimation
The latent topic model plays an important role in the unsupervised learning from a corpus, which provides a probabilistic interpretation of the corpus in terms of the latent topic...
Zhen Guo, Shenghuo Zhu, Zhongfei Zhang, Yun Chi, Y...