Sciweavers

43 search results - page 3 / 9
» Training Reinforcement Neurocontrollers Using the Polytope A...
Sort
View
ML
1998
ACM
101views Machine Learning» more  ML 1998»
13 years 6 months ago
Elevator Group Control Using Multiple Reinforcement Learning Agents
Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...
Robert H. Crites, Andrew G. Barto
NIPS
2008
13 years 7 months ago
Optimization on a Budget: A Reinforcement Learning Approach
Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...
Paul Ruvolo, Ian R. Fasel, Javier R. Movellan
ACL
2009
13 years 4 months ago
Reinforcement Learning for Mapping Instructions to Actions
In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...
S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...
ROMAN
2007
IEEE
134views Robotics» more  ROMAN 2007»
14 years 16 days ago
Learning Reward Modalities for Human-Robot-Interaction in a Cooperative Training Task
—This paper proposes a novel method of learning a users preferred reward modalities for human-robot interaction through solving a cooperative training task. A learning algorithm ...
Anja Austermann, Seiji Yamada
ICML
2007
IEEE
14 years 7 months ago
Conditional random fields for multi-agent reinforcement learning
Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...
Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...