Sciweavers

35 search results - page 3 / 7
» Dirichlet Enhanced Latent Semantic Analysis
Sort
View
HICSS
2010
IEEE
258views Biometrics» more  HICSS 2010»
13 years 5 months ago
An Empirical Comparison of Four Text Mining Methods
The amount of textual data that is available for researchers and businesses to analyze is increasing at a dramatic rate. This reality has led IS researchers to investigate various...
Sangno Lee, Jeff Baker, Jaeki Song, James C. Wethe...
WWW
2007
ACM
14 years 5 months ago
Generative models for name disambiguation
Name ambiguity is a special case of identity uncertainty where one person can be referenced by multiple name variations in different situations or even share the same name with ot...
Yang Song, Jian Huang 0002, Isaac G. Councill, Jia...
DEXA
2006
Springer
193views Database» more  DEXA 2006»
13 years 8 months ago
Understanding and Enhancing the Folding-In Method in Latent Semantic Indexing
Abstract. Latent Semantic Indexing(LSI) has been proved to be effective to capture the semantic structure of document collections. It is widely used in content-based text retrieval...
Xiang Wang 0002, Xiaoming Jin
CIKM
2009
Springer
13 years 11 months ago
Text segmentation via topic modeling: an analytical study
In this paper, the task of text segmentation is approached from a topic modeling perspective. We investigate the use of latent Dirichlet allocation (LDA) topic model to segment a ...
Hemant Misra, François Yvon, Joemon M. Jose...
JMLR
2006
138views more  JMLR 2006»
13 years 5 months ago
Noisy-OR Component Analysis and its Application to Link Analysis
We develop a new component analysis framework, the Noisy-Or Component Analyzer (NOCA), that targets high-dimensional binary data. NOCA is a probabilistic latent variable model tha...
Tomás Singliar, Milos Hauskrecht