Search Sciweavers | Sciweavers

161 search results - page 19 / 33

» Convergence Problems of General-Sum Multiagent Reinforcement...

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 2 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

click to vote

ATAL
2006
Springer

135views Intelligent Agents» more ATAL 2006»

Learning the required number of agents for complex tasks

15 years 3 months ago

Download www.damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa

claim paper

Read More »

click to vote

AI
2001
Springer

118views Artificial Intelligence» more AI 2001»

Imitation and Reinforcement Learning in Agents with Heterogeneous Actions

15 years 4 months ago

Download www.cs.toronto.edu

Reinforcement learning techniques are increasingly being used to solve di cult problems in control and combinatorial optimization with promising results. Implicit imitation can acc...

Bob Price, Craig Boutilier

claim paper

Read More »

102

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

14 years 9 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

122

click to vote

JAIR
2002

163views more JAIR 2002»

Efficient Reinforcement Learning Using Recursive Least-Squares Methods

14 years 11 months ago

Download www.jair.org

The recursive least-squares (RLS) algorithm is one of the most well-known algorithms used in adaptive filtering, system identification and adaptive control. Its popularity is main...

Xin Xu, Hangen He, Dewen Hu

claim paper

Read More »

« Prev « First page 19 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers