Search Sciweavers | Sciweavers

1236 search results - page 213 / 248

» Opposition-Based Reinforcement Learning

144

click to vote

ICRA
2007
IEEE

155views Robotics» more ICRA 2007»

Value Function Approximation on Non-Linear Manifolds for Robot Motor Control

15 years 10 months ago

Download sugiyama-www.cs.titech.ac.jp

— The least squares approach works efﬁciently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...

Masashi Sugiyama, Hirotaka Hachiya, Christopher To...

claim paper

Read More »

109

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 10 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

134

click to vote

CIMCA
2006
IEEE

164views Intelligent Agents» more CIMCA 2006»

Multi-Agent Coalition Formation for Long-Term Task or Mobile Network

15 years 10 months ago

Download digital.cs.usu.edu

Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...

Hsiu-Hui Lee, Chung-Hsien Chen

claim paper

Read More »

146

click to vote

CIS
2005
Springer

129views Applied Computing» more CIS 2005»

An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

15 years 9 months ago

Download www-clmc.usc.edu

Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...

Jooyoung Park, Jongho Kim, Daesung Kang

claim paper

Read More »

149

click to vote

AMEC
2004
Springer

243views Intelligent Agents» more AMEC 2004»

Three Automated Stock-Trading Agents: A Comparative Study

15 years 9 months ago

Download userweb.cs.utexas.edu

Abstract. This paper documents the development of three autonomous stocktrading agents within the framework of the Penn Exchange Simulator (PXS), a novel stock-trading simulator th...

Alexander A. Sherstov, Peter Stone

claim paper

Read More »

« Prev « First page 213 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers