Search Sciweavers | Sciweavers

51 search results - page 1 / 11

» Characterizing reinforcement learning methods through parame...

241

Voted

ML
2011
ACM

152views Machine Learning» more ML 2011»

Characterizing reinforcement learning methods through parameterized learning problems

14 years 6 months ago

Download www.cs.utexas.edu

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

106

Voted

AI
2006
Springer

103views Artificial Intelligence» more AI 2006»

Trace Equivalence Characterization Through Reinforcement Learning

15 years 7 months ago

Download www2.ift.ulaval.ca

In the context of probabilistic verification, we provide a new notion of trace-equivalence divergence between pairs of Labelled Markov processes. This divergence corresponds to the...

Josee Desharnais, François Laviolette, Kris...

claim paper

Read More »

134

Voted

IJCNN
2006
IEEE

127views Neural Networks» more IJCNN 2006»

Reinforcement Learning for Parameterized Motor Primitives

15 years 9 months ago

Download www-clmc.usc.edu

Abstract— One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the “building blocks of movement genera...

Jan Peters, Stefan Schaal

claim paper

Read More »

147

Voted

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 5 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

136

click to vote

ICML
2004
IEEE

156views Machine Learning» more ICML 2004»

Learning to fly by combining reinforcement learning with behavioural cloning

16 years 4 months ago

Download ccc.inaoep.mx

Reinforcement learning deals with learning optimal or near optimal policies while interacting with the environment. Application domains with many continuous variables are difficul...

Eduardo F. Morales, Claude Sammut

claim paper

Read More »

« Prev « First page 1 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers