Sciweavers

1800 search results - page 120 / 360
» Learning Restart Strategies
Sort
View
ICML
2009
IEEE
16 years 5 months ago
Online feature elicitation in interactive optimization
Most models of utility elicitation in decision support and interactive optimization assume a predefined set of "catalog" features over which user preferences are express...
Craig Boutilier, Kevin Regan, Paolo Viappiani
ICML
2007
IEEE
16 years 5 months ago
Percentile optimization in uncertain Markov decision processes with application to efficient exploration
Markov decision processes are an effective tool in modeling decision-making in uncertain dynamic environments. Since the parameters of these models are typically estimated from da...
Erick Delage, Shie Mannor
ECML
2005
Springer
15 years 10 months ago
Multi-armed Bandit Algorithms and Empirical Evaluation
The multi-armed bandit problem for a gambler is to decide which arm of a K-slot machine to pull to maximize his total reward in a series of trials. Many real-world learning and opt...
Joannès Vermorel, Mehryar Mohri
ICML
2009
IEEE
15 years 11 months ago
Feature hashing for large scale multitask learning
Empirical evidence suggests that hashing is an effective strategy for dimensionality reduction and practical nonparametric estimation. In this paper we provide exponential tail bo...
Kilian Q. Weinberger, Anirban Dasgupta, John Langf...
ICPR
2008
IEEE
15 years 11 months ago
A novel robust kernel for appearance-based learning
Robustness is one of the most critical issues in the appearance-based learning strategies. In this work, we propose a novel kernel that is robust against data corruption for vario...
Chia-Te Liao, Shang-Hong Lai