Sciweavers

827 search results - page 41 / 166
» Variational methods for Reinforcement Learning
Sort
View
134
Voted
NN
2007
Springer
105views Neural Networks» more  NN 2007»
15 years 1 days ago
Guiding exploration by pre-existing knowledge without modifying reward
Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...
Kary Främling
137
Voted
DSP
2007
15 years 15 days ago
Blind separation of nonlinear mixtures by variational Bayesian learning
Blind separation of sources from nonlinear mixtures is a challenging and often ill-posed problem. We present three methods for solving this problem: an improved nonlinear factor a...
Antti Honkela, Harri Valpola, Alexander Ilin, Juha...
113
Voted
ATAL
2007
Springer
15 years 6 months ago
Batch reinforcement learning in a complex domain
Temporal difference reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...
Shivaram Kalyanakrishnan, Peter Stone
106
Voted
ICML
2005
IEEE
16 years 1 months ago
Relating reinforcement learning performance to classification performance
We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...
John Langford, Bianca Zadrozny
AAAI
2006
15 years 1 months ago
Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping
Transfer learning concerns applying knowledge learned in one task (the source) to improve learning another related task (the target). In this paper, we use structure mapping, a ps...
Yaxin Liu, Peter Stone