Sciweavers

779 search results - page 15 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ICMLA
2008
15 years 20 days ago
Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture
In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...
Sertan Girgin, Philippe Preux
NECO
2010
97views more  NECO 2010»
14 years 9 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
KES
2007
Springer
15 years 5 months ago
Making Financial Trading by Recurrent Reinforcement Learning
In this paper we propose a financial trading system whose strategy is developed by means of an artificial neural network approach based on a recurrent reinforcement learning algo...
Francesco Bertoluzzo, Marco Corazza
INTERSPEECH
2010
14 years 6 months ago
Still talking to machines (cognitively speaking)
This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...
Steve Young