Sciweavers

969 search results - page 77 / 194
» Clustering performance data efficiently at massive scales
Sort
View
145
Voted
SIGMOD
2012
ACM
288views Database» more  SIGMOD 2012»
13 years 3 months ago
Exploiting MapReduce-based similarity joins
Cloud enabled systems have become a crucial component to efficiently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Sim...
Yasin N. Silva, Jason M. Reed
140
Voted
CCGRID
2008
IEEE
15 years 2 months ago
A Probabilistic Model to Analyse Workflow Performance on Production Grids
Production grids are complex and highly variable systems whose behavior is not well understood and difficult to anticipate. The goal of this study is to estimate the impact of the ...
Tristan Glatard, Johan Montagnat, Xavier Pennec
130
Voted
BMCBI
2008
204views more  BMCBI 2008»
15 years 26 days ago
EST2uni: an open, parallel tool for automated EST analysis and database creation, with a data mining web interface and microarra
Background: Expressed sequence tag (EST) collections are composed of a high number of single-pass, redundant, partial sequences, which need to be processed, clustered, and annotat...
Javier Forment, Francisco Gilabert Villamón...
111
Voted
VLDB
2000
ACM
99views Database» more  VLDB 2000»
15 years 4 months ago
Efficient Filtering of XML Documents for Selective Dissemination of Information
Information Dissemination applications are gaining increasing popularity due to dramatic improvements in communications bandwidth and ubiquity. The sheer volume of data available ...
Mehmet Altinel, Michael J. Franklin
INFOCOM
2002
IEEE
15 years 5 months ago
Using the Small-World Model to Improve Freenet Performance
– Efficient data retrieval in a peer-to-peer system like Freenet is a challenging problem. In this paper we study the impact of cache replacement policy on the performance of Fre...
Hui Zhang 0002, Ashish Goel, Ramesh Govindan