Search Sciweavers | Sciweavers

29 search results - page 3 / 6

» Action Refinement in Reinforcement Learning by Probability S...

click to vote

JCP
2008

139views more JCP 2008»

Agent Learning in Relational Domains based on Logical MDPs with Negation

13 years 6 months ago

Download www.academypublisher.com

In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...

Song Zhiwei, Chen Xiaoping, Cong Shuang

claim paper

Read More »

click to vote

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

13 years 5 months ago

Download www.fil.ion.ucl.ac.uk

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

click to vote

AROBOTS
1999

104views more AROBOTS 1999»

Reinforcement Learning Soccer Teams with Incomplete World Models

13 years 5 months ago

Download igitur-archive.library.uu.nl

We use reinforcement learning (RL) to compute strategies for multiagent soccer teams. RL may pro t signi cantly from world models (WMs) estimating state transition probabilities an...

Marco Wiering, Rafal Salustowicz, Jürgen Schm...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 9 days ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

TSMC
2008

132views more TSMC 2008»

Ensemble Algorithms in Reinforcement Learning

13 years 6 months ago

Download people.cs.uu.nl

This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...

Marco A. Wiering, Hado van Hasselt

claim paper

Read More »

« Prev « First page 3 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers