Search Sciweavers | Sciweavers

45 search results - page 2 / 9

» Efficient exploration through active learning for value func...

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

13 years 11 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

14 years 1 months ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

click to vote

PKDD
2009
Springer

152views Data Mining» more PKDD 2009»

Feature Selection for Value Function Approximation Using Bayesian Model Selection

13 years 11 months ago

Download userweb.cs.utexas.edu

Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

ECAI
2008
Springer

83views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with the Use of Costly Features

13 years 6 months ago

Download people.cs.kuleuven.be

In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...

Robby Goetschalckx, Scott Sanner, Kurt Driessens

claim paper

Read More »

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 6 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

« Prev « First page 2 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers