Sciweavers

1236 search results - page 213 / 248
» Opposition-Based Reinforcement Learning
Sort
View
ICRA
2007
IEEE
155views Robotics» more  ICRA 2007»
15 years 10 months ago
Value Function Approximation on Non-Linear Manifolds for Robot Motor Control
— The least squares approach works efficiently in value function approximation, given appropriate basis functions. Because of its smoothness, the Gaussian kernel is a popular an...
Masashi Sugiyama, Hirotaka Hachiya, Christopher To...
ICANN
2007
Springer
15 years 10 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...
CIMCA
2006
IEEE
15 years 10 months ago
Multi-Agent Coalition Formation for Long-Term Task or Mobile Network
Coalition formation is a process to form a group and solve a problem via cooperation. Because of the rising of network, each computing device can communicate through network. We c...
Hsiu-Hui Lee, Chung-Hsien Chen
CIS
2005
Springer
15 years 9 months ago
An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm
Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...
Jooyoung Park, Jongho Kim, Daesung Kang
AMEC
2004
Springer
15 years 9 months ago
Three Automated Stock-Trading Agents: A Comparative Study
Abstract. This paper documents the development of three autonomous stocktrading agents within the framework of the Penn Exchange Simulator (PXS), a novel stock-trading simulator th...
Alexander A. Sherstov, Peter Stone