Search Sciweavers | Sciweavers

3412 search results - page 82 / 683

» Efficient Reinforcement Learning

147

Voted

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

15 years 9 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

152

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

15 years 1 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

101

Voted

CSE
2009
IEEE

85views Theoretical Computer Science» more CSE 2009»

Reinforcement Learning of Listener Response for Mood Classification of Audio

15 years 10 months ago

Download www.oddible.com

This paper describes a method of applying a reinforcement learning artificial intelligence to categorize audio files by mood based on listener response during a performance. The s...

Jack Stockholm, Philippe Pasquier

claim paper

Read More »

106

Voted

AI
2006
Springer

103views Artificial Intelligence» more AI 2006»

Trace Equivalence Characterization Through Reinforcement Learning

15 years 7 months ago

Download www2.ift.ulaval.ca

In the context of probabilistic verification, we provide a new notion of trace-equivalence divergence between pairs of Labelled Markov processes. This divergence corresponds to the...

Josee Desharnais, François Laviolette, Kris...

claim paper

Read More »

Voted

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 5 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

« Prev « First page 82 / 683 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers