Search Sciweavers | Sciweavers

168

AGI
2008

136views Artificial Intelligence» more AGI 2008»

An Integrative Methodology for Teaching Embodied Non-Linguistic Agents, Applied to Virtual Animals in Second Life

15 years 7 months ago

A teaching methodology called Imitative-Reinforcement-Corrective (IRC) learning is described, and proposed as a general approach for teaching embodied non-linguistic AGI systems. I...

Ben Goertzel, Cassio Pennachin, Nil Geisweiller, M...

claim paper

Read More »

160

Voted

BC
1998

109views more BC 1998»

Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity

15 years 5 months ago

Download lis.epfl.ch

Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...

Javier Zamora, José del R. Millán, A...

claim paper

Read More »

162

click to vote

ROBOCUP
2000
Springer

104views Robotics» more ROBOCUP 2000»

Essex Wizards 2000 Team Description

15 years 9 months ago

Download cswww.essex.ac.uk

: This article gives an overview of the Essex Wizards 2000 team participated in the RoboCup 2000 simulator league. A brief description of the agent architecture for the team is int...

Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Kos...

claim paper

Read More »

155

click to vote

ESANN
2008

115views Neural Networks» more ESANN 2008»

15 years 7 months ago

Similarities and differences between policy gradient methods and evolution strategies

Download www.dice.ucl.ac.be

Natural policy gradient methods and the covariance matrix adaptation evolution strategy, two variable metric methods proposed for solving reinforcement learning tasks, are contrast...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

153

click to vote

NIPS
2007

80views Information Technology» more NIPS 2007»

Stable Dual Dynamic Programming

15 years 7 months ago

Download webdocs.cs.ualberta.ca

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers