Sciweavers

181 search results - page 28 / 37
» On Policy Learning in Restricted Policy Spaces
Sort
View
ICML
1996
IEEE
15 years 3 months ago
A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning
This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...
Rémi Munos
87
Voted
ICML
2004
IEEE
16 years 14 days ago
Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning
Reminder systems support people with impaired prospective memory and/or executive function, by providing them with reminders of their functional daily activities. We integrate tem...
Matthew R. Rudary, Satinder P. Singh, Martha E. Po...
96
Voted
ECML
2004
Springer
15 years 5 months ago
Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework
Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...
Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...
ECML
2007
Springer
15 years 5 months ago
Imitation Learning Using Graphical Models
Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imit...
Deepak Verma, Rajesh P. N. Rao
100
Voted
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
15 years 5 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...