Sciweavers

EMNLP
2007
13 years 5 months ago
Improving Word Sense Disambiguation Using Topic Features
This paper presents a novel approach for exploiting the global context for the task of word sense disambiguation (WSD). This is done by using topic features constructed using the ...
Junfu Cai, Wee Sun Lee, Yee Whye Teh
ACL
2008
13 years 5 months ago
Text Segmentation with LDA-Based Fisher Kernel
In this paper we propose a domainindependent text segmentation method, which consists of three components. Latent Dirichlet allocation (LDA) is employed to compute words semantic ...
Qi Sun, Runxin Li, Dingsheng Luo, Xihong Wu
DAGM
2008
Springer
13 years 6 months ago
Comparing Local Feature Descriptors in pLSA-Based Image Models
Abstract. Probabilistic models with hidden variables such as probabilistic Latent Semantic Analysis (pLSA) and Latent Dirichlet Allocation (LDA) have recently become popular for so...
Eva Hörster, Thomas Greif, Rainer Lienhart, M...
PAKDD
2010
ACM
182views Data Mining» more  PAKDD 2010»
13 years 8 months ago
Computation of Ratios of Secure Summations in Multi-party Privacy-Preserving Latent Dirichlet Allocation
In this paper, we focus our attention on the problem of computing the ratio of two numbers, both of which are the summations of the private numbers distributed in different parties...
Bin Yang, Hiroshi Nakagawa
KDD
2010
ACM
435views Data Mining» more  KDD 2010»
13 years 8 months ago
Topic models with power-law using Pitman-Yor process
One of the important approaches for Knowledge discovery and Data mining is to estimate unobserved variables because latent variables can indicate hidden and specific properties o...
Issei Sato, Hiroshi Nakagawa
AAIM
2009
Springer
172views Algorithms» more  AAIM 2009»
13 years 9 months ago
PLDA: Parallel Latent Dirichlet Allocation for Large-Scale Applications
Abstract. This paper presents PLDA, our parallel implementation of Latent Dirichlet Allocation on MPI and MapReduce. PLDA smooths out storage and computation bottlenecks and provid...
Yi Wang, Hongjie Bai, Matt Stanton, Wen-Yen Chen, ...
IRFC
2010
Springer
13 years 9 months ago
Combining Wikipedia-Based Concept Models for Cross-Language Retrieval
Abstract. As a low-cost ressource that is up-to-date, Wikipedia recently gains attention as a means to provide cross-language brigding for information retrieval. Contradictory to a...
Benjamin Roth, Dietrich Klakow
SIGIR
2003
ACM
13 years 9 months ago
On an equivalence between PLSI and LDA
Latent Dirichlet Allocation (LDA) is a fully generative approach to language modelling which overcomes the inconsistent generative semantics of Probabilistic Latent Semantic Index...
Mark Girolami, Ata Kabán
SLSFS
2005
Springer
13 years 9 months ago
Discrete Component Analysis
Abstract. This article presents a unified theory for analysis of components in discrete data, and compares the methods with techniques such as independent component analysis, non-...
Wray L. Buntine, Aleks Jakulin
SIGIR
2006
ACM
13 years 10 months ago
LDA-based document models for ad-hoc retrieval
Search algorithms incorporating some form of topic model have a long history in information retrieval. For example, cluster-based retrieval has been studied since the 60s and has ...
Xing Wei, W. Bruce Croft