Sciweavers

374 search results - page 12 / 75
» Modeling Chinese Documents with Topical Word-Character Model...
Sort
View
NIPS
2000
15 years 1 months ago
The Missing Link - A Probabilistic Model of Document Content and Hypertext Connectivity
We describe a joint probabilistic model for modeling the contents and inter-connectivity of document collections such as sets of web pages or research paper archives. The model is...
David A. Cohn, Thomas Hofmann
98
Voted
SIGIR
2006
ACM
15 years 5 months ago
LDA-based document models for ad-hoc retrieval
Search algorithms incorporating some form of topic model have a long history in information retrieval. For example, cluster-based retrieval has been studied since the 60s and has ...
Xing Wei, W. Bruce Croft
ICML
2007
IEEE
16 years 15 days ago
Mixtures of hierarchical topics with Pachinko allocation
The four-level pachinko allocation model (PAM) (Li & McCallum, 2006) represents correlations among topics using a DAG structure. It does not, however, represent a nested hiera...
David M. Mimno, Wei Li, Andrew McCallum
ACML
2009
Springer
15 years 6 months ago
Injecting Structured Data to Generative Topic Model in Enterprise Settings
Enterprises have accumulated both structured and unstructured data steadily as computing resources improve. However, previous research on enterprise data mining often treats these ...
Han Xiao, Xiaojie Wang, Chao Du
93
Voted
ACL
2012
13 years 2 months ago
Authorship Attribution with Author-aware Topic Models
Authorship attribution deals with identifying the authors of anonymous texts. Building on our earlier finding that the Latent Dirichlet Allocation (LDA) topic model can be used t...
Yanir Seroussi, Fabian Bohnert, Ingrid Zukerman