Search Sciweavers | Sciweavers

1234 search results - page 20 / 247

» Multi-criteria Reinforcement Learning

click to vote

ICMLA
2004

109views Machine Learning» more ICMLA 2004»

Variable resolution discretization in the joint space

15 years 1 months ago

Download highentropy.com

We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...

Christopher K. Monson, David Wingate, Kevin D. Sep...

claim paper

Read More »

102

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

14 years 9 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

click to vote

IEAAIE
2001
Springer

98views Artificial Intelligence» more IEAAIE 2001»

On the Relationship between Learning Capability and the Boltzmann-Formula

15 years 4 months ago

Download members.iif.hu

In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...

Péter Stefán, Laszlo Monostori

claim paper

Read More »

click to vote

ECML
2005
Springer

95views Machine Learning» more ECML 2005»

Towards Finite-Sample Convergence of Direct Reinforcement Learning

15 years 5 months ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...

Shiau Hong Lim, Gerald DeJong

claim paper

Read More »

click to vote

ATAL
2006
Springer

103views Intelligent Agents» more ATAL 2006»

Rule value reinforcement learning for cognitive agents

15 years 3 months ago

Download vega.soi.city.ac.uk

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule...

Christopher Child, Kostas Stathis

claim paper

Read More »

« Prev « First page 20 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers