Sciweavers

1235 search results - page 84 / 247
» Reinforcement learning in a nutshell
Sort
View
RAS
2006
105views more  RAS 2006»
15 years 4 months ago
Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
A class of biped locomotion called Passive Dynamic Walking (PDW) has been recognized to be efficient in energy consumption and a key to understand human walking. Although PDW is s...
Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura...
JSW
2007
112views more  JSW 2007»
15 years 4 months ago
The Challenge of Training New Architects: an Ontological and Reinforcement-Learning Methodology
— This paper describes the importance of new skilled architects in the discipline of Software and Enterprise Architecture. Architects are often idealized as super heroes with a l...
Anabel Fraga, Juan Lloréns
NN
2002
Springer
113views Neural Networks» more  NN 2002»
15 years 4 months ago
Control of exploitation-exploration meta-parameter in reinforcement learning
In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...
Shin Ishii, Wako Yoshida, Junichiro Yoshimoto
ACL
2009
15 years 2 months ago
Reinforcement Learning for Mapping Instructions to Actions
In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...
S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...
ICASSP
2011
IEEE
14 years 8 months ago
Bayesian reinforcement learning for POMDP-based dialogue systems
Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...
ShaoWei Png, Joelle Pineau