Sciweavers

50 search results - page 10 / 10
» Reinforcement Learning for Penalty Avoidance in Continuous S...
Sort
View
ICML
2001
IEEE
14 years 6 months ago
Direct Policy Search using Paired Statistical Tests
Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...
Malcolm J. A. Strens, Andrew W. Moore
AAMAS
2007
Springer
13 years 11 months ago
Networks of Learning Automata and Limiting Games
Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is that...
Peter Vrancx, Katja Verbeeck, Ann Nowé
MIRRORBOT
2005
Springer
154views Robotics» more  MIRRORBOT 2005»
13 years 10 months ago
Spatial Representation and Navigation in a Bio-inspired Robot
Abstract. A biologically inspired computational model of rodent representation–based (locale) navigation is presented. The model combines visual input in the form of realistic tw...
Denis Sheynikhovich, Ricardo Chavarriaga, Thomas S...
AAAI
2006
13 years 6 months ago
Hard Constrained Semi-Markov Decision Processes
In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...
Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong
ICML
2010
IEEE
13 years 6 months ago
Inverse Optimal Control with Linearly-Solvable MDPs
We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...
Dvijotham Krishnamurthy, Emanuel Todorov