Search Sciweavers | Sciweavers

139 search results - page 10 / 28

» An Empirical Comparison of Four Text Mining Methods

click to vote

ICML
2003
IEEE

87views Machine Learning» more ICML 2003»

Text Bundling: Statistics Based Data-Reduction

16 years 14 days ago

Download www.hpl.hp.com

As text corpora become larger, tradeoffs between speed and accuracy become critical: slow but accurate methods may not complete in a practical amount of time. In order to make the...

Lawrence Shih, Jason D. Rennie, Yu-Han Chang, Davi...

claim paper

Read More »

click to vote

KDD
2009
ACM

191views Data Mining» more KDD 2009»

Efficient methods for topic model inference on streaming document collections

16 years 6 days ago

Download www.cs.umass.edu

Topic models provide a powerful tool for analyzing large text collections by representing high dimensional data in a low dimensional subspace. Fitting a topic model given a set of...

Limin Yao, David M. Mimno, Andrew McCallum

claim paper

Read More »

click to vote

ICDM
2007
IEEE

170views Data Mining» more ICDM 2007»

Consensus Clusterings

15 years 6 months ago

Download www.cs.cornell.edu

In this paper we address the problem of combining multiple clusterings without access to the underlying features of the data. This process is known in the literature as clustering...

Nam Nguyen, Rich Caruana

claim paper

Read More »

click to vote

ICDM
2003
IEEE

181views Data Mining» more ICDM 2003»

Dynamic Weighted Majority: A New Ensemble Method for Tracking Concept Drift

15 years 5 months ago

Download www.stanford.edu

Algorithms for tracking concept drift are important for many applications. We present a general method based on the Weighted Majority algorithm for using any online learner for co...

Jeremy Z. Kolter, Marcus A. Maloof

claim paper

Read More »

116

click to vote

DMIN
2007

226views Data Mining» more DMIN 2007»

Generative Oversampling for Mining Imbalanced Datasets

15 years 1 months ago

Download www.ideal.ece.utexas.edu

— One way to handle data mining problems where class prior probabilities and/or misclassiﬁcation costs between classes are highly unequal is to resample the data until a new, d...

Alexander Liu, Joydeep Ghosh, Cheryl Martin

claim paper

Read More »

« Prev « First page 10 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers