Search Sciweavers | Sciweavers

2011 search results - page 22 / 403

» Universal Reinforcement Learning

267

click to vote

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 2 months ago

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

162

click to vote

ATAL
2004
Springer

116views Intelligent Agents» more ATAL 2004»

Time-Extended Policies in Multi-Agent Reinforcement Learning

15 years 10 months ago

Download web.engr.oregonstate.edu

Many algorithms such as Q-learning successfully address reinforcement learning in single-agent multi-time-step problems. In addition there are methods that address reinforcement l...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

104

click to vote

NIPS
2000

112views Information Technology» more NIPS 2000»

Balancing Multiple Sources of Reward in Reinforcement Learning

15 years 6 months ago

Download www.cc.gatech.edu

For many problems which would be natural for reinforcement learning, the reward signal is not a single scalar value but has multiple scalar components. Examples of such problems i...

Christian R. Shelton

claim paper

Read More »

112

click to vote

ML
2007
ACM

73views Machine Learning» more ML 2007»

Online calibrated forecasts: Memory efficiency versus universality for learning in games

15 years 4 months ago

Download www.cs.caltech.edu

Shie Mannor, Jeff S. Shamma, Gürdal Arslan

claim paper

Read More »

157

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 9 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

« Prev « First page 22 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers