Sciweavers

385 search results - page 24 / 77
» Improving data mining utility with projective sampling
Sort
View
ICMLA
2004
14 years 11 months ago
Reducing complexity of rule based models via meta mining
Complexity, or in other words compactness, of models generated by rule learners is one of often neglected issues, although it has a profound effect on the success of any project t...
Lukasz A. Kurgan
101
Voted
KDD
2004
ACM
302views Data Mining» more  KDD 2004»
15 years 10 months ago
Redundancy based feature selection for microarray data
In gene expression microarray data analysis, selecting a small number of discriminative genes from thousands of genes is an important problem for accurate classification of diseas...
Lei Yu, Huan Liu
PAKDD
2004
ACM
127views Data Mining» more  PAKDD 2004»
15 years 3 months ago
Separating Structure from Interestingness
Condensed representations of pattern collections have been recognized to be important building blocks of inductive databases, a promising theoretical framework for data mining, and...
Taneli Mielikäinen
83
Voted
ICDM
2007
IEEE
133views Data Mining» more  ICDM 2007»
15 years 4 months ago
Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval
Most topic models, such as latent Dirichlet allocation, rely on the bag-of-words assumption. However, word order and phrases are often critical to capturing the meaning of text in...
Xuerui Wang, Andrew McCallum, Xing Wei
ICDM
2010
IEEE
147views Data Mining» more  ICDM 2010»
14 years 7 months ago
Location and Scatter Matching for Dataset Shift in Text Mining
Dataset shift from the training data in a source domain to the data in a target domain poses a great challenge for many statistical learning methods. Most algorithms can be viewed ...
Bo Chen, Wai Lam, Ivor W. Tsang, Tak-Lam Wong