Sciweavers

8 search results - page 1 / 2
» Dynamic Reward Shaping: Training a Robot by Voice
Sort
View
IBERAMIA
2010
Springer
13 years 3 months ago
Dynamic Reward Shaping: Training a Robot by Voice
Reinforcement Learning is commonly used for learning tasks in robotics, however, traditional algorithms can take very long training times. Reward shaping has been recently used to ...
Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...
AAAI
2000
13 years 6 months ago
Interactive Training for Synthetic Characters
Compelling synthetic characters must behave in ways that reflect their past experience and thus allow for individual personalization. We therefore need a method that allows charac...
Song-Yee Yoon, Robert C. Burke, Bruce Blumberg, Ge...
CEC
2009
IEEE
13 years 11 months ago
How robot morphology and training order affect the learning of multiple behaviors
— Automatically synthesizing behaviors for robots with articulated bodies poses a number of challenges beyond those encountered when generating behaviors for simpler agents. One ...
Joshua S. Auerbach, Josh C. Bongard
SPEAKERC
2007
Springer
312views Biometrics» more  SPEAKERC 2007»
13 years 11 months ago
Development of a Femininity Estimator for Voice Therapy of Gender Identity Disorder Clients
Abstract. This work describes the development of an automatic estimator of perceptual femininity (PF) of an input utterance using speaker verification techniques. The estimator wa...
Nobuaki Minematsu, Kyoko Sakuraba
AIPS
2010
13 years 7 months ago
When Policies Can Be Trusted: Analyzing a Criteria to Identify Optimal Policies in MDPs with Unknown Model Parameters
Computing a good policy in stochastic uncertain environments with unknown dynamics and reward model parameters is a challenging task. In a number of domains, ranging from space ro...
Emma Brunskill