Search Sciweavers | Sciweavers

664 search results - page 40 / 133

» Combining Reinforcement Learning with a Local Control Algori...

173

Voted

ICRA
2008
IEEE

134views Robotics» more ICRA 2008»

Real-time learning of resolved velocity control on a Mitsubishi PA-10

16 years 22 days ago

Download www-clmc.usc.edu

Abstract— Learning inverse kinematics has long been fascinating the robot learning community. While humans acquire this transformation to complicated tool spaces with ease, it is...

Jan Peters, Duy Nguyen-Tuong

claim paper

Read More »

177

Voted

HIS
2008

122views Information Technology» more HIS 2008»

New Crossover Operator for Evolutionary Rule Discovery in XCS

15 years 7 months ago

Download www.salle.url.edu

XCS is a learning classifier system that combines a reinforcement learning scheme with evolutionary algorithms to evolve rule sets on-line by means of the interaction with an envi...

Sergio Morales-Ortigosa, Albert Orriols-Puig, Este...

claim paper

Read More »

165

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

15 years 4 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

158

Voted

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 7 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

171

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 7 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

« Prev « First page 40 / 133 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers