Search Sciweavers | Sciweavers

1236 search results - page 178 / 248

» Opposition-Based Reinforcement Learning

138

Voted

SMC
2007
IEEE

102views Control Systems» more SMC 2007»

An improved immune Q-learning algorithm

15 years 10 months ago

Download web2.uwindsor.ca

—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...

Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...

claim paper

Read More »

150

click to vote

IROS
2006
IEEE

113views Robotics» more IROS 2006»

Policy Gradient Methods for Robotics

15 years 10 months ago

Download www.cs.utah.edu

— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...

Jan Peters, Stefan Schaal

claim paper

Read More »

155

Voted

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 9 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

121

click to vote

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 6 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

141

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Modeling Human Decision Making in Cliff-Edge Environments

15 years 5 months ago

Download www.aaai.org

In this paper we propose a model for human learning and decision making in environments of repeated Cliff-Edge (CE) interactions. In CE environments, which include common daily in...

Ron Katz, Sarit Kraus

claim paper

Read More »

« Prev « First page 178 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers