Sciweavers

4544 search results - page 147 / 909
» Reinforcement Learning with Time
Sort
View
ICAART
2011
INSTICC
14 years 7 months ago
Optimal Sample Selection for Batch-mode Reinforcement Learning
Emmanuel Rachelson, François Schnitzler, Lo...
SASO
2009
IEEE
15 years 10 months ago
Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems
—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conflicting, policies of varying spatial and temporal scope. As a result, not all ag...
Ivana Dusparic, Vinny Cahill
ICRA
2008
IEEE
128views Robotics» more  ICRA 2008»
15 years 10 months ago
Learning from human teachers with Socially Guided Exploration
— We present a learning mechanism, Socially Guided Exploration, in which a robot learns new tasks through a combination of self-exploration and social interaction. The system’s...
Cynthia Breazeal, Andrea Lockerd Thomaz
ATAL
2009
Springer
15 years 10 months ago
Solving multiagent assignment Markov decision processes
We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...
Scott Proper, Prasad Tadepalli