Sciweavers

8 search results - page 1 / 2
» Reinforcement learning for DEC-MDPs with changing action set...
Sort
View
ATAL
2008
Springer
13 years 6 months ago
Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies
Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...
Thomas Gabel, Martin A. Riedmiller
NECO
2007
150views more  NECO 2007»
13 years 4 months ago
Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule
Learning agents, whether natural or artificial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...
Dorit Baras, Ron Meir
JAIR
2008
148views more  JAIR 2008»
13 years 4 months ago
Learning Partially Observable Deterministic Action Models
We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...
Eyal Amir, Allen Chang
ICANN
2001
Springer
13 years 9 months ago
Market-Based Reinforcement Learning in Partially Observable Worlds
Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...
Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber
ICRA
1995
IEEE
123views Robotics» more  ICRA 1995»
13 years 8 months ago
Vision-Based Reinforcement Learning for Purposive Behavior Acquisition
This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal, and discusses several issues in applying the reinforcement...
Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, ...