Sciweavers

SIGMOD
2010
ACM
260views Database» more  SIGMOD 2010»
13 years 9 months ago
Towards proximity pattern mining in large graphs
Mining graph patterns in large networks is critical to a variety of applications such as malware detection and biological module discovery. However, frequent subgraphs are often i...
Arijit Khan, Xifeng Yan, Kun-Lung Wu
SIGMOD
2010
ACM
364views Database» more  SIGMOD 2010»
13 years 9 months ago
Optimizing schema-last tuple-store queries in graphd
Current relational databases require that a database schema exist prior to data entry and require manual optimization for best performance. We describe the query optimization tech...
Scott M. Meyer, Jutta Degener, John Giannandrea, B...
SIGMOD
2010
ACM
312views Database» more  SIGMOD 2010»
13 years 9 months ago
TwitterMonitor: trend detection over the twitter stream
We present TwitterMonitor, a system that performs trend detection over the Twitter stream. The system identifies emerging topics (i.e. ‘trends’) on Twitter in real time and p...
Michael Mathioudakis, Nick Koudas
SIGMOD
2010
ACM
321views Database» more  SIGMOD 2010»
13 years 9 months ago
HadoopDB in action: building real world applications
HadoopDB is a hybrid of MapReduce and DBMS technologies, designed to meet the growing demand of analyzing massive datasets on very large clusters of machines. Our previous work ha...
Azza Abouzied, Kamil Bajda-Pawlikowski, Jiewen Hua...
SIGMOD
2010
ACM
208views Database» more  SIGMOD 2010»
13 years 9 months ago
Efficient parallel set-similarity joins using MapReduce
Rares Vernica, Michael J. Carey, Chen Li
SIGMOD
2010
ACM
377views Database» more  SIGMOD 2010»
13 years 9 months ago
Online aggregation and continuous query support in MapReduce
MapReduce is a popular framework for data-intensive distributed computing of batch jobs. To simplify fault tolerance, the output of each MapReduce task and job is materialized to ...
Tyson Condie, Neil Conway, Peter Alvaro, Joseph M....
SIGMOD
2010
ACM
236views Database» more  SIGMOD 2010»
13 years 9 months ago
Differentially private aggregation of distributed time-series with transformation and encryption
We propose PASTE, the first differentially private aggregation algorithms for distributed time-series data that offer good practical utility without any trusted server. PASTE add...
Vibhor Rastogi, Suman Nath
SIGMOD
2010
ACM
231views Database» more  SIGMOD 2010»
13 years 9 months ago
Automatically incorporating new sources in keyword search-based data integration
Scientific data offers some of the most interesting challenges in data integration today. Scientific fields evolve rapidly and accumulate masses of observational and experiment...
Partha Pratim Talukdar, Zachary G. Ives, Fernando ...
SIGMOD
2010
ACM
153views Database» more  SIGMOD 2010»
13 years 9 months ago
Threshold query optimization for uncertain data
The probabilistic threshold query (PTQ) is one of the most common queries in uncertain databases, where all results satisfying the query with probabilities that meet the threshold...
Yinian Qi, Rohit Jain, Sarvjeet Singh, Sunil Prabh...
SIGMOD
2010
ACM
197views Database» more  SIGMOD 2010»
13 years 9 months ago
TACO: tunable approximate computation of outliers in wireless sensor networks
Nikos Giatrakos, Yannis Kotidis, Antonios Deligian...