Sciweavers

41 search results - page 6 / 9
» Large Scale Parallel Document Mining for Machine Translation
Sort
View
83
Voted
GRID
2008
Springer
14 years 10 months ago
Troubleshooting thousands of jobs on production grids using data mining techniques
Large scale production computing grids introduce new challenges in debugging and troubleshooting. A user that submits a workload consisting of tens of thousands of jobs to a grid ...
David A. Cieslak, Nitesh V. Chawla, Douglas Thain
ICWSM
2008
14 years 11 months ago
International Sentiment Analysis for News and Blogs
There is a growing interest in mining opinions using sentiment analysis methods from sources such as news, blogs and product reviews. Most of these methods have been developed for...
Mikhail Bautin, Lohit Vijayarenu, Steven Skiena
68
Voted
WSDM
2009
ACM
148views Data Mining» more  WSDM 2009»
15 years 4 months ago
Information arbitrage across multi-lingual Wikipedia
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a r...
Eytan Adar, Michael Skinner, Daniel S. Weld
KDD
2007
ACM
132views Data Mining» more  KDD 2007»
15 years 10 months ago
A scalable modular convex solver for regularized risk minimization
A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with different algorithms using different notions of risk and different r...
Choon Hui Teo, Alex J. Smola, S. V. N. Vishwanatha...
170
Voted
ICDE
2004
IEEE
151views Database» more  ICDE 2004»
15 years 11 months ago
Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks
We study the problem of maintaining large replicated collections of files or documents in a distributed environment with limited bandwidth. This problem arises in a number of impo...
Torsten Suel, Patrick Noel, Dimitre Trendafilov