Sciweavers

1233 search results - page 159 / 247
» Reinforcement learning
Sort
View
AGI
2008
15 years 5 months ago
An Integrative Methodology for Teaching Embodied Non-Linguistic Agents, Applied to Virtual Animals in Second Life
A teaching methodology called Imitative-Reinforcement-Corrective (IRC) learning is described, and proposed as a general approach for teaching embodied non-linguistic AGI systems. I...
Ben Goertzel, Cassio Pennachin, Nil Geisweiller, M...
BC
1998
109views more  BC 1998»
15 years 3 months ago
Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity
Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...
Javier Zamora, José del R. Millán, A...
ROBOCUP
2000
Springer
104views Robotics» more  ROBOCUP 2000»
15 years 7 months ago
Essex Wizards 2000 Team Description
: This article gives an overview of the Essex Wizards 2000 team participated in the RoboCup 2000 simulator league. A brief description of the agent architecture for the team is int...
Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Kos...
ESANN
2008
15 years 5 months ago
Similarities and differences between policy gradient methods and evolution strategies
Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...
Verena Heidrich-Meisner, Christian Igel
NIPS
2007
15 years 5 months ago
Stable Dual Dynamic Programming
Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...
Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...