Sciweavers

1235 search results - page 84 / 247
» Reinforcement learning in a nutshell
Sort
View
RAS
2006
105views more  RAS 2006»
14 years 10 months ago
Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
A class of biped locomotion called Passive Dynamic Walking (PDW) has been recognized to be efficient in energy consumption and a key to understand human walking. Although PDW is s...
Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura...
JSW
2007
112views more  JSW 2007»
14 years 10 months ago
The Challenge of Training New Architects: an Ontological and Reinforcement-Learning Methodology
— This paper describes the importance of new skilled architects in the discipline of Software and Enterprise Architecture. Architects are often idealized as super heroes with a l...
Anabel Fraga, Juan Lloréns
NN
2002
Springer
113views Neural Networks» more  NN 2002»
14 years 10 months ago
Control of exploitation-exploration meta-parameter in reinforcement learning
In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...
Shin Ishii, Wako Yoshida, Junichiro Yoshimoto
ACL
2009
14 years 8 months ago
Reinforcement Learning for Mapping Instructions to Actions
In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...
S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...
ICASSP
2011
IEEE
14 years 2 months ago
Bayesian reinforcement learning for POMDP-based dialogue systems
Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...
ShaoWei Png, Joelle Pineau