Sciweavers

250 search results - page 23 / 50
» Learning action effects in partially observable domains
Sort
View
ECAI
2010
Springer
14 years 10 months ago
The Dynamics of Multi-Agent Reinforcement Learning
Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...
Luke Dickens, Krysia Broda, Alessandra Russo
CONNECTION
2008
178views more  CONNECTION 2008»
14 years 9 months ago
Spoken language interaction with model uncertainty: an adaptive human-robot interaction system
Spoken language is one of the most intuitive forms of interaction between humans and agents. Unfortunately, agents that interact with people using natural language often experienc...
Finale Doshi, Nicholas Roy
AAAI
2000
14 years 11 months ago
Back to the Future for Consistency-Based Trajectory Tracking
Given a model of a physical process and a sequence of commands and observations received over time, the task of an autonomous controller is to determine the likely states of the p...
James Kurien, P. Pandurang Nayak
78
Voted
ICRA
2009
IEEE
179views Robotics» more  ICRA 2009»
15 years 4 months ago
Automatic weight learning for multiple data sources when learning from demonstration
— Traditional approaches to programming robots are generally inaccessible to non-robotics-experts. A promising exception is the Learning from Demonstration paradigm. Here a polic...
Brenna Argall, Brett Browning, Manuela M. Veloso
71
Voted
UAI
2001
14 years 11 months ago
The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...
Lex Weaver, Nigel Tao