Sciweavers

8 search results - page 2 / 2
» Doubly Robust Policy Evaluation and Learning
Sort
View
INFOCOM
2009
IEEE
13 years 11 months ago
Analysis of Adaptive Incentive Protocols for P2P Networks
— Incentive protocols play a crucial role to encourage cooperation among nodes in networking applications. The aim of this paper is to provide a general analytical framework to a...
Ben Q. Zhao, John C. S. Lui, Dah-Ming Chiu
ICRA
2002
IEEE
141views Robotics» more  ICRA 2002»
13 years 9 months ago
Movement Imitation with Nonlinear Dynamical Systems in Humanoid Robots
This article presents a new approach to movement planning, on-line trajectory modification, and imitation learning by representing movement plans based on a set of nonlinear di...
Auke Jan Ijspeert, Jun Nakanishi, Stefan Schaal
JMLR
2010
148views more  JMLR 2010»
12 years 11 months ago
A Generalized Path Integral Control Approach to Reinforcement Learning
With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal