Search Sciweavers | Sciweavers

99 search results - page 4 / 20

» Action Selection in Bayesian Reinforcement Learning

click to vote

IEAAIE
2001
Springer

98views Artificial Intelligence» more IEAAIE 2001»

On the Relationship between Learning Capability and the Boltzmann-Formula

13 years 10 months ago

Download members.iif.hu

In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...

Péter Stefán, Laszlo Monostori

claim paper

Read More »

click to vote

IAT
2003
IEEE

171views Intelligent Agents» more IAT 2003»

Asymmetric Multiagent Reinforcement Learning

13 years 11 months ago

Download lib.tkk.fi

A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the ...

Ville Könönen

claim paper

Read More »

click to vote

AAMAS
2002
Springer

130views Intelligent Agents» more AAMAS 2002»

Relational Reinforcement Learning for Agents in Worlds with Objects

13 years 5 months ago

Download www-ai.ijs.si

In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action in a given state of the environment, so that it maximizes the total amount of reward it ...

Saso Dzeroski

claim paper

Read More »

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Deictic Option Schemas

13 years 7 months ago

Download www.ijcai.org

Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this...

Balaraman Ravindran, Andrew G. Barto, Vimal Mathew

claim paper

Read More »

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

13 years 3 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

« Prev « First page 4 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers