Sciweavers

779 search results - page 15 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ICMLA
2008
15 years 2 months ago
Basis Function Construction in Reinforcement Learning Using Cascade-Correlation Learning Architecture
In reinforcement learning, it is a common practice to map the state(-action) space to a different one using basis functions. This transformation aims to represent the input data i...
Sertan Girgin, Philippe Preux
NECO
2010
97views more  NECO 2010»
14 years 11 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
KES
2007
Springer
15 years 6 months ago
Making Financial Trading by Recurrent Reinforcement Learning
In this paper we propose a financial trading system whose strategy is developed by means of an artificial neural network approach based on a recurrent reinforcement learning algo...
Francesco Bertoluzzo, Marco Corazza
125
Voted
INTERSPEECH
2010
14 years 7 months ago
Still talking to machines (cognitively speaking)
This overview article reviews the structure of a fully statistical spoken dialogue system (SDS), using as illustration, various systems and components built at Cambridge over the ...
Steve Young