Sciweavers

374 search results - page 16 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
CIKM
2008
Springer
15 years 1 months ago
Modeling hidden topics on document manifold
Topic modeling has been a key problem for document analysis. One of the canonical approaches for topic modeling is Probabilistic Latent Semantic Indexing, which maximizes the join...
Deng Cai, Qiaozhu Mei, Jiawei Han, Chengxiang Zhai
PAMI
2010
113views more  PAMI 2010»
14 years 10 months ago
Hierarchical Bayesian Modeling of Topics in Time-Stamped Documents
—We consider the problem of inferring and modeling topics in a sequence of documents with known publication dates. The documents at a given time are each characterized by a topic...
Iulian Pruteanu-Malinici, Lu Ren, John William Pai...
CVPR
2009
IEEE
15 years 3 months ago
Robust unsupervised segmentation of degraded document images with topic models
Segmentation of document images remains a challenging vision problem. Although document images have a structured layout, capturing enough of it for segmentation can be difficult....
Timothy J. Burns, Jason J. Corso
ICADL
2007
Springer
132views Education» more  ICADL 2007»
15 years 5 months ago
On Building a Full-Text Digital Library of Historical Documents
The National Taiwan University Library has built a digital library of historical documents about Taiwan. The content is unique in that it covers about 80% of all primary Chinese hi...
Szu-Pei Chen, Jieh Hsiang, Hsieh-Chang Tu, Micha W...
ACL
2009
14 years 9 months ago
Multi-Document Summarization using Sentence-based Topic Models
Dingding Wang, Shenghuo Zhu, Tao Li, Yihong Gong