Sciweavers

4544 search results - page 43 / 909

» Reinforcement Learning with Time

105

EWRL
2008

133views Machine Learning» more EWRL 2008»

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

15 years 5 months ago

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning

Download ewrl08.futurs.inria.fr

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

99

Voted

ML
2008
ACM

95views Machine Learning» more ML 2008»

Transfer in variable-reward hierarchical reinforcement learning

15 years 3 months ago

Transfer in variable-reward hierarchical reinforcement learning

Download web.engr.oregonstate.edu

Neville Mehta, Sriraam Natarajan, Prasad Tadepalli...

claim paper

Read More »

92

ML
2000
ACM

133views Machine Learning» more ML 2000»

Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms

15 years 3 months ago

Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms

Download www.cs.rutgers.edu

Satinder P. Singh, Tommi Jaakkola, Michael L. Litt...

claim paper

Read More »

223

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 28 days ago

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

88

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

15 years 5 months ago

Balancing Multiple Sources of Reward in Reinforcement Learning

Download www.cc.gatech.edu

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

« Prev « First page 43 / 909 Last » Next »