Sciweavers

1396 search results - page 24 / 280
» Polylingual Topic Models
Sort
View
CORR
2004
Springer
128views Education» more  CORR 2004»
14 years 9 months ago
Unsupervised Topic Adaptation for Lecture Speech Retrieval
We are developing a cross-media information retrieval system, in which users can view specific segments of lecture videos by submitting text queries. To produce a text index, the ...
Atsushi Fujii, Katunobu Itou, Tomoyosi Akiba, Tets...
103
Voted
NIPS
2001
14 years 11 months ago
Latent Dirichlet Allocation
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian m...
David M. Blei, Andrew Y. Ng, Michael I. Jordan
EMNLP
2010
14 years 7 months ago
Exploiting Conversation Structure in Unsupervised Topic Segmentation for Emails
This work concerns automatic topic segmentation of email conversations. We present a corpus of email threads manually annotated with topics, and evaluate annotator reliability. To...
Shafiq R. Joty, Giuseppe Carenini, Gabriel Murray,...
SIGIR
2002
ACM
14 years 9 months ago
A critical examination of TDT's cost function
Topic Detection and Tracking (TDT) tasks are evaluated using a cost function. The standard TDT cost function assumes a constant probability of relevance P(rel) across all topics. ...
R. Manmatha, Ao Feng, James Allan
CIKM
2010
Springer
14 years 8 months ago
Decomposing background topics from keywords by principal component pursuit
Low-dimensional topic models have been proven very useful for modeling a large corpus of documents that share a relatively small number of topics. Dimensionality reduction tools s...
Kerui Min, Zhengdong Zhang, John Wright, Yi Ma