Sciweavers

1577 search results - page 268 / 316
» Data Mining: Machine Learning, Statistics, and Databases
Sort
View
OSDI
2008
ACM
15 years 2 months ago
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
DryadLINQ is a system and a set of language extensions that enable a new programming model for large scale distributed computing. It generalizes previous execution environments su...
Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Bud...
ICML
2008
IEEE
16 years 18 days ago
Sequence kernels for predicting protein essentiality
The problem of identifying the minimal gene set required to sustain life is of crucial importance in understanding cellular mechanisms and designing therapeutic drugs. This work d...
Cyril Allauzen, Mehryar Mohri, Ameet Talwalkar
ILP
2007
Springer
15 years 6 months ago
Bias/Variance Analysis for Relational Domains
Bias/variance analysis is a useful tool for investigating the performance of machine learning algorithms. Conventional analysis decomposes loss into errors due to aspects of the le...
Jennifer Neville, David Jensen
CIKM
2010
Springer
14 years 10 months ago
Visual cube and on-line analytical processing of images
On-Line Analytical Processing (OLAP) has shown great success in many industry applications, including sales, marketing, management, financial data analysis, etc. In this paper, w...
Xin Jin, Jiawei Han, Liangliang Cao, Jiebo Luo, Bo...
100
Voted
CIKM
2000
Springer
15 years 4 months ago
Scalable association-based text classification
Naïve Bayes (NB) classifier has long been considered a core methodology in text classification mainly due to its simplicity and computational efficiency. There is an increasing n...
Dimitris Meretakis, Dimitris Fragoudis, Hongjun Lu...