Search Sciweavers | Sciweavers

458 search results - page 2 / 92

» Q-Decomposition for Reinforcement Learning Agents

click to vote

AAAI
2010

161views Intelligent Agents» more AAAI 2010»

Learning Methods to Generate Good Plans: Integrating HTN Learning and Reinforcement Learning

13 years 6 months ago

Download www.cse.lehigh.edu

Chad Hogg, Ugur Kuter, Hector Muñoz-Avila

claim paper

Read More »

click to vote

AAMAS
2002
Springer

130views Intelligent Agents» more AAMAS 2002»

Relational Reinforcement Learning for Agents in Worlds with Objects

13 years 4 months ago

Download www-ai.ijs.si

In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action in a given state of the environment, so that it maximizes the total amount of reward it ...

Saso Dzeroski

claim paper

Read More »

click to vote

ICML
1998
IEEE

202views Machine Learning» more ICML 1998»

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

13 years 9 months ago

Download www.cs.mcgill.ca

We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa( )-algorithm. Then we solve the ...

Jette Randløv, Preben Alstrøm

claim paper

Read More »

click to vote

ATAL
2006
Springer

103views Intelligent Agents» more ATAL 2006»

Rule value reinforcement learning for cognitive agents

13 years 8 months ago

Download vega.soi.city.ac.uk

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...

Christopher Child, Kostas Stathis

claim paper

Read More »

click to vote

CORR
2011
Springer

136views Education» more CORR 2011»

Reinforcement Learning for Agents with Many Sensors and Actuators Acting in Categorizable Environments

12 years 8 months ago

Download www.aaai.org

In this paper, we confront the problem of applying reinforcement learning to agents that perceive the environment through many sensors and that can perform parallel actions using ...

Enric Celaya, Josep M. Porta

claim paper

Read More »

« Prev « First page 2 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers