Sciweavers

579 search results - page 14 / 116
» Modeling word burstiness using the Dirichlet distribution
Sort
View
ICML
2006
IEEE
16 years 15 days ago
Pachinko allocation: DAG-structured mixture models of topic correlations
Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not ca...
Wei Li, Andrew McCallum
CN
2006
126views more  CN 2006»
14 years 11 months ago
A measurement study of correlations of Internet flow characteristics
Previous studies of Internet traffic have shown that a very small percentage of flows consume most of the network bandwidth. It is important to understand the characteristics of s...
Kun-Chan Lan, John S. Heidemann
ICDE
2009
IEEE
130views Database» more  ICDE 2009»
16 years 1 months ago
A Latent Topic Model for Complete Entity Resolution
In bibliographies like DBLP and Citeseer, there are three kinds of entity-name problems that need to be solved. First, multiple entities share one name, which is called the name sh...
Liangcai Shu, Bo Long, Weiyi Meng
JCB
2002
74views more  JCB 2002»
14 years 11 months ago
Using Substitution Matrices to Estimate Probability Distributions for Biological Sequences
Accurately estimating probabilities from observations is important for probabilistic-based approaches to problems in computational biology. In this paper we present a biologically...
Eleazar Eskin, William Stafford Noble, Yoram Singe...
JMLR
2010
137views more  JMLR 2010»
14 years 6 months ago
Covariance in Unsupervised Learning of Probabilistic Grammars
Probabilistic grammars offer great flexibility in modeling discrete sequential data like natural language text. Their symbolic component is amenable to inspection by humans, while...
Shay B. Cohen, Noah A. Smith