Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

13 years 5 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints: self-preservation (foraging) and self-reproduction (mating). This paper shows a method to evolve an agent’s exploratory reward by combining a framework of embodied evolution and the algorithm of constrained policy gradient reinforcement learning. Biological constraints are modeled by the average criteria, and the exploratory reward is computed from its own sensor information. The agent in which a part of constraints are satisﬁed is allowed to mate with another agent. If a mating behavior is successfully made between two agents, one of genetic operations is applied according to ﬁtness values to improve the exploratory rewards. Through learning and embodied evolution, a group of agents obtain appropriate exploratory rewards.

Eiji Uchibe, Kenji Doya

Real-time Traffic

Agent’s Exploratory Reward | Biological Constraints | Exploratory Rewards | ICONIP 2007 | Information Technology |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2007
Where	ICONIP
Authors	Eiji Uchibe, Kenji Doya

Comments (0)

Sciweavers

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

Agent’s Exploratory Reward | Biological Constraints | Exploratory Rewards | ICONIP 2007 | Information Technology |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers