Sciweavers

3084 search results - page 125 / 617
» Learning to Take Actions
Sort
View
AAAI
1997
14 years 11 months ago
Reinforcement Learning with Time
This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...
Daishi Harada
NIPS
2003
14 years 11 months ago
All learning is Local: Multi-agent Learning in Global Reward Games
In large multiagent games, partial observability, coordination, and credit assignment persistently plague attempts to design good learning algorithms. We provide a simple and efï¬...
Yu-Han Chang, Tracey Ho, Leslie Pack Kaelbling
ECML
2007
Springer
15 years 4 months ago
Safe Q-Learning on Complete History Spaces
In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...
Stephan Timmer, Martin Riedmiller
RA
2003
135views Robotics» more  RA 2003»
14 years 11 months ago
Behavioural Cloning and Robot Control
Behavioural cloning is a method by which a machine learns control skills through observing what a human controller would do in a certain set of circumstances. More specifically, t...
Claire D'Este, Mark O'Sullivan, Nicholas Hannah
NN
2006
Springer
14 years 10 months ago
The misbehavior of value and the discipline of the will
Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...
Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...