Distributed and Parallel Data Mining on the Grid

11 years 5 months ago
Distributed and Parallel Data Mining on the Grid
Abstract: This paper presents the initial design and implementation of a Gridbased distributed and parallel data mining system. The Grid system, namely the Business Intelligence Grid or BIGrid, is based on heterogeneous Grid server configurations and service-oriented Grid architecture. The system follows a layered design, whose infrastructure is divided into three tiers in general - Grid tier, a service tier and a client/portal tier. Issues of design and implementation, including brokering, task scheduling, adaptive mining script preparation and parallelization are discussed. The design and implementation of BIGrid help identify the specific requirements of applying Grid-based data mining in business realm, thus pave way for future design and implementation of a real generic Gridbased data mining system.
Tianchao Li, Toni Bollinger
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Where ARCS
Authors Tianchao Li, Toni Bollinger
Comments (0)