Sciweavers

676 search results - page 81 / 136
» Distributed and Parallel Data Mining on the Grid
Sort
View
CCGRID
2008
IEEE
15 years 10 months ago
Scalable Data Gathering for Real-Time Monitoring Systems on Distributed Computing
Real-time monitoring is increasingly becoming important in various scenes of large scale, multi-site distributed/parallel computing, e.g, understanding behavior of systems, schedu...
Yoshikazu Kamoshida, Kenjiro Taura
163
Voted
GRID
2006
Springer
15 years 4 months ago
Stream processing in data-driven computational science
The use of real-time data streams in data-driven computational science is driving the need for stream processing tools that work within the architectural framework of the larger ap...
Ying Liu, Nithya N. Vijayakumar, Beth Plale
APPT
2005
Springer
15 years 9 months ago
Principal Component Analysis for Distributed Data Sets with Updating
Identifying the patterns of large data sets is a key requirement in data mining. A powerful technique for this purpose is the principal component analysis (PCA). PCA-based clusteri...
Zheng-Jian Bai, Raymond H. Chan, Franklin T. Luk
HPDC
2003
IEEE
15 years 9 months ago
Optimizing GridFTP through Dynamic Right-Sizing
In this paper, we describe the integration of dynamic right-sizing — an automatic and scalable buffer management technique for enhancing TCP performance — into GridFTP, a sub...
Sunil Thulasidasan, Wu-chun Feng, Mark K. Gardner
ICDM
2010
IEEE
189views Data Mining» more  ICDM 2010»
15 years 1 months ago
S4: Distributed Stream Computing Platform
Abstract--S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop applications for processing continu...
Leonardo Neumeyer, Bruce Robbins, Anish Nair, Anan...