Search Sciweavers | Sciweavers

50 search results - page 10 / 10

» Reinforcement Learning for Penalty Avoidance in Continuous S...

click to vote

ICML
2001
IEEE

159views Machine Learning» more ICML 2001»

Direct Policy Search using Paired Statistical Tests

14 years 6 months ago

Download www.autonlab.org

Direct policy search is a practical way to solve reinforcement learning problems involving continuous state and action spaces. The goal becomes finding policy parameters that maxi...

Malcolm J. A. Strens, Andrew W. Moore

claim paper

Read More »

click to vote

AAMAS
2007
Springer

164views Intelligent Agents» more AAMAS 2007»

Networks of Learning Automata and Limiting Games

13 years 11 months ago

Download como.vub.ac.be

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is that...

Peter Vrancx, Katja Verbeeck, Ann Nowé

claim paper

Read More »

click to vote

MIRRORBOT
2005
Springer

154views Robotics» more MIRRORBOT 2005»

Spatial Representation and Navigation in a Bio-inspired Robot

13 years 10 months ago

Download icwww.epfl.ch

Abstract. A biologically inspired computational model of rodent representation–based (locale) navigation is presented. The model combines visual input in the form of realistic tw...

Denis Sheynikhovich, Ricardo Chavarriaga, Thomas S...

claim paper

Read More »

click to vote

AAAI
2006

118views Intelligent Agents» more AAAI 2006»

Hard Constrained Semi-Markov Decision Processes

13 years 6 months ago

Download www.aaai.org

In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...

Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong

claim paper

Read More »

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

13 years 6 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

« Prev « First page 10 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers