Sciweavers

1233 search results - page 202 / 247
» Reinforcement Learning in MirrorBot
Sort
View
COMCOM
2008
127views more  COMCOM 2008»
14 years 9 months ago
A dynamic routing protocol for keyword search in unstructured peer-to-peer networks
The idea of building query-oriented routing indices has changed the way of improving keyword search efficiency from the basis as it can learn the content distribution from the que...
Cong Shi, Dingyi Han, Yuanjie Liu, Shicong Meng, Y...
SMC
2007
IEEE
102views Control Systems» more  SMC 2007»
15 years 4 months ago
An improved immune Q-learning algorithm
—Reinforcement learning is a framework in which an agent can learn behavior without knowledge on a task or an environment by exploration and exploitation. Striking a balance betw...
Zhengqiao Ji, Q. M. Jonathan Wu, Maher A. Sid-Ahme...
IROS
2006
IEEE
113views Robotics» more  IROS 2006»
15 years 3 months ago
Policy Gradient Methods for Robotics
— The aquisition and improvement of motor skills and control policies for robotics from trial and error is of essential importance if robots should ever leave precisely pre-struc...
Jan Peters, Stefan Schaal
ECML
2005
Springer
15 years 3 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
ICALT
2005
IEEE
15 years 3 months ago
Can Collaborative Technologies Improve Management Education?
This paper explores the potential impact of collaborative technologies on improving management education. The first goal is to expose students to tools and practices that not only...
Marie-Noëlle Bessagnet, Lee Schlenker, Robert...