Sciweavers

4544 search results - page 43 / 909
» Reinforcement Learning with Time
Sort
View
EWRL
2008
14 years 11 months ago
Exploiting Additive Structure in Factored MDPs for Reinforcement Learning
Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...
ML
2008
ACM
14 years 9 months ago
Transfer in variable-reward hierarchical reinforcement learning
Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...
ML
2000
ACM
133views Machine Learning» more  ML 2000»
14 years 9 months ago
Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms
Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...
ICAART
2010
INSTICC
15 years 6 months ago
Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning
There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...
Christos Dimitrakakis
NIPS
2000
14 years 11 months ago
Balancing Multiple Sources of Reward in Reinforcement Learning
For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...
Christian R. Shelton