Sciweavers

2227 search results - page 239 / 446
» Graph Mining based on a Data Partitioning Approach
Sort
View
123
Voted
KDD
2004
ACM
195views Data Mining» more  KDD 2004»
16 years 4 months ago
Improved robustness of signature-based near-replica detection via lexicon randomization
Detection of near duplicate documents is an important problem in many data mining and information filtering applications. When faced with massive quantities of data, traditional d...
Aleksander Kolcz, Abdur Chowdhury, Joshua Alspecto...
FCCM
2009
IEEE
134views VLSI» more  FCCM 2009»
15 years 7 months ago
Efficient Mapping of Hardware Tasks on Reconfigurable Computers Using Libraries of Architecture Variants
Scheduling and partitioning of task graphs on reconfigurable hardware needs to be carefully carried out in order to achieve the best possible performance. In this paper, we demons...
Miaoqing Huang, Vikram K. Narayana, Tarek A. El-Gh...
160
Voted
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
15 years 10 months ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
140
Voted
ICDM
2009
IEEE
145views Data Mining» more  ICDM 2009»
15 years 1 months ago
Significance of Episodes Based on Minimal Windows
Discovering episodes, frequent sets of events from a sequence has been an active field in pattern mining. Traditionally, a level-wise approach is used to discover all frequent epis...
Nikolaj Tatti
106
Voted
ICDM
2003
IEEE
109views Data Mining» more  ICDM 2003»
15 years 8 months ago
Comparing Pure Parallel Ensemble Creation Techniques Against Bagging
We experimentally evaluate randomization-based approaches to creating an ensemble of decision-tree classifiers. Unlike methods related to boosting, all of the eight approaches co...
Lawrence O. Hall, Kevin W. Bowyer, Robert E. Banfi...