Sciweavers

187 search results - page 21 / 38
» Imitation and Reinforcement Learning in Agents with Heteroge...
Sort
View
ICANN
2001
Springer
15 years 1 months ago
Market-Based Reinforcement Learning in Partially Observable Worlds
Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...
Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber
ATAL
2009
Springer
15 years 4 months ago
Solving multiagent assignment Markov decision processes
We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...
Scott Proper, Prasad Tadepalli
GECCO
2009
Springer
200views Optimization» more  GECCO 2009»
15 years 4 months ago
Apply ant colony optimization to Tetris
Tetris is a falling block game where the player’s objective is to arrange a sequence of different shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...
Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...
PRIMA
2009
Springer
15 years 4 months ago
Recursive Adaptation of Stepsize Parameter for Non-stationary Environments
In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...
Itsuki Noda
ISADS
1999
IEEE
15 years 1 months ago
Emergence of Communication for Negotiation by a Recurrent Neural Network
We believe that communication in multi-agent system has two major meanings. One of them is to transmit one agent's observed information to the other. The other meaning is to ...
Katsunari Shibata, Koji Ito