Sciweavers

1390 search results - page 173 / 278
» Self-Sizing of Clustered Databases
Sort
View
147
Voted
EDBT
2012
ACM
306views Database» more  EDBT 2012»
13 years 4 months ago
Clydesdale: structured data processing on MapReduce
MapReduce has emerged as a promising architecture for large scale data analytics on commodity clusters. The rapid adoption of Hive, a SQL-like data processing language on Hadoop (...
Tim Kaldewey, Eugene J. Shekita, Sandeep Tata
DEXAW
1999
IEEE
97views Database» more  DEXAW 1999»
15 years 5 months ago
Mining Several Data Bases with an Ensemble of Classifiers
The results of knowledge discovery in databases could vary depending on the data mining method. There are several ways to select the most appropriate data mining method dynamicall...
Seppo Puuronen, Vagan Y. Terziyan, Alexander Logvi...
VLDB
2005
ACM
118views Database» more  VLDB 2005»
15 years 7 months ago
Selectivity Estimation for Fuzzy String Predicates in Large Data Sets
Many database applications have the emerging need to support fuzzy queries that ask for strings that are similar to a given string, such as “name similar to smith” and “tele...
Liang Jin, Chen Li
KDD
2006
ACM
156views Data Mining» more  KDD 2006»
16 years 2 months ago
Discovering significant OPSM subspace clusters in massive gene expression data
Order-preserving submatrixes (OPSMs) have been accepted as a biologically meaningful subspace cluster model, capturing the general tendency of gene expressions across a subset of ...
Byron J. Gao, Obi L. Griffith, Martin Ester, Steve...
127
Voted
CVPR
2007
IEEE
16 years 3 months ago
Discriminative Cluster Refinement: Improving Object Category Recognition Given Limited Training Data
A popular approach to problems in image classification is to represent the image as a bag of visual words and then employ a classifier to categorize the image. Unfortunately, a si...
Liu Yang, Rong Jin, Caroline Pantofaru, Rahul Sukt...