Sciweavers

473 search results - page 60 / 95
» Optimal policy switching algorithms for reinforcement learni...
Sort
View
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
14 years 9 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone
CEC
2011
IEEE
13 years 11 months ago
On universal search strategies for multi-criteria optimization using weighted sums
—We develop a stochastic local search algorithm for finding Pareto points for multi-criteria optimization problems. The algorithm alternates between different single-criterium o...
Julien Legriel, Scott Cotton, Oded Maler
GECCO
2005
Springer
111views Optimization» more  GECCO 2005»
15 years 5 months ago
XCS with eligibility traces
The development of the XCS Learning Classifier System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...
Jan Drugowitsch, Alwyn Barry
IJCNN
2006
IEEE
15 years 5 months ago
Learning a Rendezvous Task with Dynamic Joint Action Perception
Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...
Nancy Fulda, Dan Ventura
ICML
1994
IEEE
15 years 3 months ago
Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...