Sampling is a widely used technique to increase efficiency in database and data mining applications operating on large dataset. In this paper we present a scalable sampling imple...
— EGEE, along with its sister project LCG, manages the world’s largest Grid production infrastructure which is spreading nowadays over 260 sites in more than 40 countries. Just...
As multi/many core processors become prevalent, programming language is important in constructing efficient parallel applications. In this work, we build a multithreaded video min...
Wenlong Li, Eric Li, Ran Meng, Tao Wang, Carole Du...
A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with different algorithms using different notions of risk and different r...
Choon Hui Teo, Alex J. Smola, S. V. N. Vishwanatha...
Today’s autonomous and heterogeneous information systems have an increasing necessity of exchanging data. As widely known, middleware systems have been successfully adopted as i...