Sciweavers

103 search results - page 3 / 21
» Using Data Mining Techniques to Learn Layouts of Flat-File B...
Sort
View
ICCS
2003
Springer
13 years 10 months ago
A Compress-Based Association Mining Algorithm for Large Dataset
The association mining is one of the primary sub-areas in the field of data mining. This technique had been used in numerous practical applications, including consumer market baske...
Mafruz Zaman Ashrafi, David Taniar, Kate A. Smith
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
14 years 5 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman
ICPR
2006
IEEE
14 years 5 months ago
Finding Rule Groups to Classify High Dimensional Gene Expression Datasets
Microarray data provides quantitative information about the transcription profile of cells. To analyze microarray datasets, methodology of machine learning has increasingly attrac...
Jiyuan An, Yi-Ping Phoebe Chen
AIMSA
2004
Springer
13 years 10 months ago
PubMiner: Machine Learning-Based Text Mining System for Biomedical Information Mining
PubMiner, an intelligent machine learning based text mining system for mining biological information from the literature is introduced. PubMiner utilize natural language processing...
Jae-Hong Eom, Byoung-Tak Zhang
KDD
1998
ACM
120views Data Mining» more  KDD 1998»
13 years 9 months ago
Large Datasets Lead to Overly Complex Models: An Explanation and a Solution
This paper explores unexpected results that lie at the intersection of two common themes in the KDD community: large datasets and the goal of building compact models. Experiments ...
Tim Oates, David Jensen