Sciweavers

1396 search results - page 24 / 280
» Polylingual Topic Models
Sort
View
CORR
2004
Springer
128views Education» more  CORR 2004»
14 years 11 months ago
Unsupervised Topic Adaptation for Lecture Speech Retrieval
We are developing a cross-media information retrieval system, in which users can view specific segments of lecture videos by submitting text queries. To produce a text index, the ...
Atsushi Fujii, Katunobu Itou, Tomoyosi Akiba, Tets...
NIPS
2001
15 years 1 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan
EMNLP
2010
14 years 9 months ago
Exploiting Conversation Structure in Unsupervised Topic Segmentation for Emails
This work concerns automatic topic segmentation of email conversations. We present a corpus of email threads manually annotated with topics, and evaluate annotator reliability. To...
Shafiq R. Joty, Giuseppe Carenini, Gabriel Murray,...
SIGIR
2002
ACM
14 years 11 months ago
A critical examination of TDT's cost function
Topic Detection and Tracking (TDT) tasks are evaluated using a cost function. The standard TDT cost function assumes a constant probability of relevance P(rel) across all topics. ...
R. Manmatha, Ao Feng, James Allan
CIKM
2010
Springer
14 years 10 months ago
Decomposing background topics from keywords by principal component pursuit
Low-dimensional topic models have been proven very useful for modeling a large corpus of documents that share a relatively small number of topics. Dimensionality reduction tools s...
Kerui Min, Zhengdong Zhang, John Wright, Yi Ma