Sciweavers

1234 search results - page 20 / 247
» Multi-criteria Reinforcement Learning
Sort
View
ICMLA
2004
15 years 1 months ago
Variable resolution discretization in the joint space
We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...
Christopher K. Monson, David Wingate, Kevin D. Sep...
ICMLA
2010
14 years 9 months ago
Multimodal Parameter-exploring Policy Gradients
Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...
Frank Sehnke, Alex Graves, Christian Osendorfer, J...
IEAAIE
2001
Springer
15 years 4 months ago
On the Relationship between Learning Capability and the Boltzmann-Formula
In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...
Péter Stefán, Laszlo Monostori
ECML
2005
Springer
15 years 5 months ago
Towards Finite-Sample Convergence of Direct Reinforcement Learning
Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...
Shiau Hong Lim, Gerald DeJong
ATAL
2006
Springer
15 years 3 months ago
Rule value reinforcement learning for cognitive agents
RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...
Christopher Child, Kostas Stathis