Sciweavers

139 search results - page 27 / 28
» The Introspective Robot: Using Self-Prediction to Improve Ro...
Sort
View
ECML
2005
Springer
13 years 11 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
FLAIRS
2008
13 years 8 months ago
Learning in the Lexical-Grammatical Interface
Children are facile at both discovering word boundaries and using those words to build higher-level structures in tandem. Current research treats lexical acquisition and grammar i...
Tom Armstrong, Tim Oates
HRI
2007
ACM
13 years 9 months ago
Efficient model learning for dialog management
Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...
Finale Doshi, Nicholas Roy
IROS
2007
IEEE
143views Robotics» more  IROS 2007»
13 years 12 months ago
Metrics for quantifying system performance in intelligent, fault-tolerant multi-robot teams
— Any system that has the capability to diagnose and recover from faults is considered to be a fault-tolerant system. Additionally, the quality of the incorporated fault-toleranc...
Balajee Kannan, Lynne E. Parker
AAAI
2008
13 years 8 months ago
Make3D: Depth Perception from a Single Still Image
Humans have an amazing ability to perceive depth from a single still image; however, it remains a challenging problem for current computer vision systems. In this paper, we will p...
Ashutosh Saxena, Min Sun, Andrew Y. Ng