Search Sciweavers | Sciweavers

651 search results - page 76 / 131

» Algorithms for Inverse Reinforcement Learning

224

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

15 years 2 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

169

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

16 years 8 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

195

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 7 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

235

Voted

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

15 years 5 months ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

211

click to vote

IJHIS
2006

94views more IJHIS 2006»

A new fine-grained evolutionary algorithm based on cellular learning automata

15 years 7 months ago

Download ceit.aut.ac.ir

In this paper, a new evolutionary computing model, called CLA-EC, is proposed. This model is a combination of a model called cellular learning automata (CLA) and the evolutionary ...

Reza Rastegar, Mohammad Reza Meybodi, Arash Hariri

claim paper

Read More »

« Prev « First page 76 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers