Search Sciweavers | Sciweavers

50 search results - page 1 / 10

» Nonparametric Return Distribution Approximation for Reinforc...

127

Voted

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

15 years 5 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

129

click to vote

P2P
2006
IEEE

101views Communications» more P2P 2006»

Reinforcement Learning for Query-Oriented Routing Indices in Unstructured Peer-to-Peer Networks

15 years 10 months ago

Download www.cc.gatech.edu

The idea of building query-oriented routing indices has changed the way of improving routing efﬁciency from the basis as it can learn the content distribution during the query r...

Cong Shi, Shicong Meng, Yuanjie Liu, Dingyi Han, Y...

claim paper

Read More »

146

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 5 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

131

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 5 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

162

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 5 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

« Prev « First page 1 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers