reinforcement learning

9

ATAL
2006
Springer

135views Intelligent Agents» more ATAL 2006»

Learning the required number of agents for complex tasks

13 years 8 months ago

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa

claim paper

Read More »

15

click to vote

ATAL
2006
Springer

192views Intelligent Agents» more ATAL 2006»

A hierarchical approach to efficient reinforcement learning in deterministic domains

13 years 8 months ago

Download paul.rutgers.edu

Factored representations, model-based learning, and hierarchies are well-studied techniques for improving the learning efficiency of reinforcement-learning algorithms in large-sca...

Carlos Diuk, Alexander L. Strehl, Michael L. Littm...

claim paper

Read More »

13

click to vote

ATAL
2006
Springer

103views Intelligent Agents» more ATAL 2006»

Rule value reinforcement learning for cognitive agents

13 years 8 months ago

Download vega.soi.city.ac.uk

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...

Christopher Child, Kostas Stathis

claim paper

Read More »

13

click to vote

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

13 years 8 months ago

Download www.tu-ilmenau.de

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

13

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

13 years 9 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

20

click to vote

ICML
1998
IEEE

202views Machine Learning» more ICML 1998»

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

13 years 9 months ago

Download www.cs.mcgill.ca

We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa( )-algorithm. Then we solve the ...

Jette Randløv, Preben Alstrøm

claim paper

Read More »

16

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

13 years 9 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

11

click to vote

IEAAIE
2001
Springer

98views Artificial Intelligence» more IEAAIE 2001»

On the Relationship between Learning Capability and the Boltzmann-Formula

13 years 9 months ago

Download members.iif.hu

In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...

Péter Stefán, Laszlo Monostori

claim paper

Read More »

9

click to vote

AI
2001
Springer

118views Artificial Intelligence» more AI 2001»

Imitation and Reinforcement Learning in Agents with Heterogeneous Actions

13 years 9 months ago

Download www.cs.toronto.edu

Reinforcement learning techniques are increasingly being used to solve di cult problems in control and combinatorial optimization with promising results. Implicit imitation can acc...

Bob Price, Craig Boutilier

claim paper

Read More »

14

click to vote

GECCO
2003
Springer

79views Optimization» more GECCO 2003»

Reinforcement Learning Estimation of Distribution Algorithm

13 years 10 months ago

Download www.iba.t.u-tokyo.ac.jp

Abstract. This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions ...

Topon Kumar Paul, Hitoshi Iba

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers