Sciweavers

1672 search results - page 287 / 335
» Optimizing Monitoring Queries over Distributed Data
Sort
View
90
Voted
KRDB
1998
93views Database» more  KRDB 1998»
14 years 11 months ago
Intelligent Caching for Information Mediators: A KR Based Approach
We present a semantic caching approach for optimizing the performance of information mediators. A critical problem with information mediators, particularly those gathering and int...
Naveen Ashish, Craig A. Knoblock, Cyrus Shahabi
KDD
2012
ACM
194views Data Mining» more  KDD 2012»
13 years 19 days ago
A sparsity-inducing formulation for evolutionary co-clustering
Traditional co-clustering methods identify block structures from static data matrices. However, the data matrices in many applications are dynamic; that is, they evolve smoothly o...
Shuiwang Ji, Wenlu Zhang, Jun Liu
93
Voted
CLUSTER
2001
IEEE
15 years 1 months ago
Clusterfile: A Flexible Physical Layout Parallel File System
This paper presents Clusterfile, a parallel file system that provides parallel file access on a cluster of computers. Existing parallel file systems offer little control over matc...
Florin Isaila, Walter F. Tichy
183
Voted
PAKDD
2011
ACM
473views Data Mining» more  PAKDD 2011»
14 years 3 months ago
 Finding Rare Classes: Adapting Generative and Discriminative Models in Active Learning
Discovering rare categories and classifying new instances of them is an important data mining issue in many fields, but fully supervised learning of a rare class classifier is pr...
Timothy Hospedales, Shaogang Gong and Tao Xiang
97
Voted
KDD
2004
ACM
210views Data Mining» more  KDD 2004»
15 years 10 months ago
Probabilistic author-topic models for information discovery
We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...