Search Sciweavers | Sciweavers

181 search results - page 1 / 37

» On Policy Learning in Restricted Policy Spaces

Voted

AAAI
2007

71views Intelligent Agents» more AAAI 2007»

On Policy Learning in Restricted Policy Spaces

15 years 1 months ago

Download www.aaai.org

Robby Goetschalckx, Jan Ramon

claim paper

Read More »

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

16 years 6 days ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

119

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 6 days ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

14 years 9 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

113

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

14 years 11 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

« Prev « First page 1 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers