Sciweavers

374 search results - page 13 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
95
Voted
CIKM
2009
Springer
15 years 6 months ago
Cross-language linking of news stories on the web using interlingual topic modelling
We have studied the problem of linking event information across different languages without the use of translation systems or dictionaries. The linking is based on interlingua in...
Wim De Smet, Marie-Francine Moens
ECIR
2009
Springer
15 years 9 months ago
Topic and Trend Detection in Text Collections Using Latent Dirichlet Allocation
Algorithms that enable the process of automatically mining distinct topics in document collections have become increasingly important due to their applications in many fields and ...
Levent Bolelli, Seyda Ertekin, C. Lee Giles
ICDAR
2011
IEEE
13 years 11 months ago
Script-Free Text Line Segmentation Using Interline Space Model for Printed Document Images
—This paper proposes a model-based text line segmentation algorithm for machine-printed document images. The model is based on geometric configuration which uses the interline sp...
Minwoo Kim, Il-Seok Oh
ICASSP
2008
IEEE
15 years 6 months ago
A comparative study of probabilistic ranking models for spoken document summarization
The purpose of extractive document summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a tar...
Shih-Hsiang Lin, Yi-Ting Chen, Hsin-Min Wang, Bin ...
AIRS
2009
Springer
15 years 6 months ago
A Latent Dirichlet Framework for Relevance Modeling
Relevance-based language models operate by estimating the probabilities of observing words in documents relevant (or pseudo relevant) to a topic. However, these models assume that ...
Viet Ha-Thuc, Padmini Srinivasan