Sciweavers

1211 search results - page 45 / 243
» Topics in 0--1 data
Sort
View
NAACL
2003
15 years 1 months ago
Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures
Sources of training data suitable for language modeling of conversational speech are limited. In this paper, we show how training data can be supplemented with text from the web ï...
Ivan Bulyko, Mari Ostendorf, Andreas Stolcke
WOA
2010
14 years 9 months ago
Classification of Whereabouts Patterns From Large-scale Mobility Data
Classification of users' whereabouts patterns is important for many emerging ubiquitous computing applications. Latent Dirichlet Allocation (LDA) is a powerful mechanism to e...
Laura Ferrari, Marco Mamei
RIAO
2004
15 years 1 months ago
Mining Textual Data through Term Variant Clustering : the TermWatch system
We present a system for mapping the structure of research topics in a corpus. TermWatch portrays the "aboutness" of a corpus of scientific and technical publications by ...
Fidelia Ibekwe-Sanjuan, Eric SanJuan
KDD
2004
ACM
210views Data Mining» more  KDD 2004»
16 years 8 days ago
Probabilistic author-topic models for information discovery
We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...
KDD
2007
ACM
122views Data Mining» more  KDD 2007»
16 years 8 days ago
Expertise modeling for matching papers with reviewers
An essential part of an expert-finding task, such as matching reviewers to submitted papers, is the ability to model the expertise of a person based on documents. We evaluate seve...
David M. Mimno, Andrew McCallum