Sciweavers

1310 search results - page 174 / 262
» Progressive Optimization in Action
Sort
View
ICRA
2007
IEEE
161views Robotics» more  ICRA 2007»
15 years 11 months ago
Manipulation Planning Among Movable Obstacles
— This paper presents the ResolveSpatialConstraints (RSC) algorithm for manipulation planning in a domain with movable obstacles. Empirically we show that our algorithm quickly g...
Mike Stilman, Jan-Ullrich Schamburek, James Kuffne...
ICRA
2007
IEEE
134views Robotics» more  ICRA 2007»
15 years 11 months ago
Towards a Real-Time Bayesian Imitation System for a Humanoid Robot
Abstract— Imitation learning, or programming by demonstration (PbD), holds the promise of allowing robots to acquire skills from humans with domain-specific knowledge, who nonet...
Aaron P. Shon, Joshua J. Storz, Rajesh P. N. Rao
IROS
2007
IEEE
122views Robotics» more  IROS 2007»
15 years 11 months ago
Design and experimental evaluation of rotor speed regulators for model helicopters in a test bench
— This work is focused on the design of a rotor speed regulator for small-scale helicopters. A nominal model identification for the main rotor is performed using a hybrid proced...
Alexandre Simões Martins, Antônio Pad...
ATAL
2007
Springer
15 years 11 months ago
Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective
This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...
Liviu Panait, Karl Tuyls
106
Voted
ECML
2007
Springer
15 years 11 months ago
Safe Q-Learning on Complete History Spaces
In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...
Stephan Timmer, Martin Riedmiller