Sciweavers

350 search results - page 53 / 70
» On Strategy Improvement Algorithms for Simple Stochastic Gam...
Sort
View
COLT
2010
Springer
14 years 7 months ago
Robust Selective Sampling from Single and Multiple Teachers
We present a new online learning algorithm in the selective sampling framework, where labels must be actively queried before they are revealed. We prove bounds on the regret of ou...
Ofer Dekel, Claudio Gentile, Karthik Sridharan
ICRA
2010
IEEE
145views Robotics» more  ICRA 2010»
14 years 8 months ago
Reinforcement learning of motor skills in high dimensions: A path integral approach
— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...
Evangelos Theodorou, Jonas Buchli, Stefan Schaal
ALGOSENSORS
2009
Springer
15 years 4 months ago
Link Reversal: How to Play Better to Work Less
Sensor networks, with their ad hoc deployments, node mobility, and wireless communication, pose serious challenges for developing provably correct and efficient applications. A po...
Bernadette Charron-Bost, Jennifer L. Welch, Josef ...
SELMAS
2004
Springer
15 years 2 months ago
A Software Framework for Automated Negotiation
If agents are to negotiate automatically with one another they must share a negotiation mechanism, specifying what possible actions each party can take at any given time, when nego...
Claudio Bartolini, Chris Preist, Nicholas R. Jenni...
IDMS
2000
Springer
123views Multimedia» more  IDMS 2000»
15 years 1 months ago
How to Keep a Dead Man from Shooting
The state-of-the-art approach to realize consistency in distributed virtual environments (e.g., action games, multi-user virtual reality, and battlefield simulations) is dead recko...
Martin Mauve