Search Sciweavers | Sciweavers

226 search results - page 17 / 46

» Linear Bayesian Reinforcement Learning

115

click to vote

ECAI
2008
Springer

83views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with the Use of Costly Features

15 years 4 months ago

Download people.cs.kuleuven.be

In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...

Robby Goetschalckx, Scott Sanner, Kurt Driessens

claim paper

Read More »

156

click to vote

TSP
2011

230views more TSP 2011»

Bayesian Nonparametric Inference of Switching Dynamic Linear Models

14 years 9 months ago

Download web.mit.edu

—Many complex dynamical phenomena can be effectively modeled by a system that switches among a set of conditionally linear dynamical modes. We consider two such models: the switc...

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan,...

claim paper

Read More »

114

click to vote

ATAL
2008
Springer

99views Intelligent Agents» more ATAL 2008»

Non-linear dynamics in multiagent reinforcement learning algorithms

15 years 4 months ago

Download www.aamas-conference.org

Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Only a subset of these MARL algorithms both do not require agent...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

132

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 1 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

107

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 6 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

« Prev « First page 17 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers