Search Sciweavers | Sciweavers

37 search results - page 4 / 8

» Analysis of Inverse Reinforcement Learning with Perturbed De...

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 8 days ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

15 years 6 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

105

click to vote

AAAI
1997

173views Intelligent Agents» more AAAI 1997»

The "Inverse Hollywood Problem": From Video to Scripts and Storyboards via Causal Analysis

15 years 25 days ago

Download www.cs.kent.edu

We address the problem of visually detecting causal events and tting them together into a coherent story of the action witnessed by the camera. We show that this can be done by re...

Matthew Brand

claim paper

Read More »

103

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

16 years 8 days ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

15 years 3 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

« Prev « First page 4 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers