Search Sciweavers | Sciweavers

397 search results - page 27 / 80

» Reinforcement Learning with Hierarchies of Machines

101

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

15 years 3 months ago

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function....

Michael L. Littman

claim paper

Read More »

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

14 years 11 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

AIIDE
2006

123views Artificial Intelligence» more AIIDE 2006»

The Self Organization of Context for Learning in MultiAgent Games

15 years 1 months ago

Download www.aaai.org

Reinforcement learning is an effective machine learning paradigm in domains represented by compact and discrete state-action spaces. In high-dimensional and continuous domains, ti...

Christopher D. White, Dave Brogan

claim paper

Read More »

100

click to vote

ICML
2005
IEEE

93views Machine Learning» more ICML 2005»

Relating reinforcement learning performance to classification performance

16 years 19 days ago

Download hunch.net

We prove a quantitative connection between the expected sum of rewards of a policy and binary classification performance on created subproblems. This connection holds without any ...

John Langford, Bianca Zadrozny

claim paper

Read More »

106

click to vote

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

16 years 19 days ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

« Prev « First page 27 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers