Search Sciweavers | Sciweavers

54 search results - page 2 / 11

» Convergence Results for Single-Step On-Policy Reinforcement-...

152

Voted

ICAI
2004

116views Artificial Intelligence» more ICAI 2004»

Action Inhibition

15 years 7 months ago

Download mysite.verizon.net

An explicit exploration strategy is necessary in reinforcement learning (RL) to balance the need to reduce the uncertainty associated with the expected outcome of an action and the...

Myriam Abramson

claim paper

Read More »

170

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 7 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

167

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 6 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

179

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 6 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

144

Voted

ATAL
2008
Springer

99views Intelligent Agents» more ATAL 2008»

Non-linear dynamics in multiagent reinforcement learning algorithms

15 years 8 months ago

Download www.aamas-conference.org

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

« Prev « First page 2 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers