Search Sciweavers | Sciweavers

148 search results - page 16 / 30

» Reinforcement Learning for P2P Searching

148

click to vote

RAS
2000

161views more RAS 2000»

Active object recognition by view integration and reinforcement learning

15 years 3 months ago

Download www.emt.tu-graz.ac.at

A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. Active recognition of three-dimensional objects involves the observer in a sear...

Lucas Paletta, Axel Pinz

claim paper

Read More »

131

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 5 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

131

click to vote

IJCAI
2007

140views Artificial Intelligence» more IJCAI 2007»

Utile Distinctions for Relational Reinforcement Learning

15 years 5 months ago

Download www.ijcai.org

We introduce an approach to autonomously creating state space abstractions for an online reinforcement learning agent using a relational representation. Our approach uses a tree-b...

William Dabney, Amy McGovern

claim paper

Read More »

158

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 5 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

138

Voted

ICANN
2010
Springer

166views Neural Networks» more ICANN 2010»

Exploring Continuous Action Spaces with Diffusion Trees for Reinforcement Learning

15 years 5 months ago

Download www.tu-ilmenau.de

We propose a new approach for reinforcement learning in problems with continuous actions. Actions are sampled by means of a diffusion tree, which generates samples in the continuou...

Christian Vollmer, Erik Schaffernicht, Horst-Micha...

claim paper

Read More »

« Prev « First page 16 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers