Sciweavers

340 search results - page 62 / 68
» Kernelized value function approximation for reinforcement le...
Sort
View
ATAL
2004
Springer
15 years 2 months ago
Unifying Temporal and Structural Credit Assignment Problems
Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...
Adrian K. Agogino, Kagan Tumer
111
Voted
ML
2008
ACM
248views Machine Learning» more  ML 2008»
14 years 9 months ago
Feature selection via sensitivity analysis of SVM probabilistic outputs
Feature selection is an important aspect of solving data-mining and machine-learning problems. This paper proposes a feature-selection method for the Support Vector Machine (SVM) l...
Kai Quan Shen, Chong Jin Ong, Xiao Ping Li, Einar ...
126
Voted
SAGA
2009
Springer
15 years 4 months ago
Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies
We propose a generic method for obtaining quickly good upper bounds on the minimal value of a multistage stochastic program. The method is based on the simulation of a feasible dec...
Boris Defourny, Damien Ernst, Louis Wehenkel
88
Voted
SCALESPACE
2005
Springer
15 years 3 months ago
Vortex and Source Particles for Fluid Motion Estimation
In this paper we propose a new motion estimator for image sequences depicting fluid flows. The proposed estimator is based on the Helmholtz decomposition of vector fields. This ...
Anne Cuzol, Étienne Mémin
98
Voted
ICML
2006
IEEE
15 years 10 months ago
Fast direct policy evaluation using multiscale analysis of Markov diffusion processes
Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...
Mauro Maggioni, Sridhar Mahadevan