Sciweavers

3961 search results - page 608 / 793
» Algorithmic Statistics
Sort
View
ICML
2009
IEEE
16 years 1 months ago
Independent factor topic models
Topic models such as Latent Dirichlet Allocation (LDA) and Correlated Topic Model (CTM) have recently emerged as powerful statistical tools for text document modeling. In this pap...
Duangmanee Putthividhya, Hagai Thomas Attias, Srik...
ICML
2009
IEEE
16 years 1 months ago
Sparse higher order conditional random fields for improved sequence labeling
In real sequence labeling tasks, statistics of many higher order features are not sufficient due to the training data sparseness, very few of them are useful. We describe Sparse H...
Xian Qian, Xiaoqian Jiang, Qi Zhang, Xuanjing Huan...
WSDM
2009
ACM
172views Data Mining» more  WSDM 2009»
16 years 1 months ago
Clustering the tagged web
Automatically clustering web pages into semantic groups promises improved search and browsing on the web. In this paper, we demonstrate how user-generated tags from largescale soc...
Daniel Ramage, Paul Heymann, Christopher D. Mannin...
WWW
2009
ACM
16 years 1 months ago
Threshold selection for web-page classification with highly skewed class distribution
We propose a novel cost-efficient approach to threshold selection for binary web-page classification problems with imbalanced class distributions. In many binary-classification ta...
Xiaofeng He, Lei Duan, Yiping Zhou, Byron Dom
CSO
2009
IEEE
16 years 1 months ago
Association Rules Based Data Mining on Test Data of Physical Health Standard
With the development of modern electronic and computer technologies, sports training and competition became more and more technical. A great deal of data were recorded, including ...
Lan Yu