value function | Sciweavers

13

AI
2006
Springer

167views Artificial Intelligence» more AI 2006»

Belief Selection in Point-Based Planning Algorithms for POMDPs

13 years 8 months ago

Abstract. Current point-based planning algorithms for solving partially observable Markov decision processes (POMDPs) have demonstrated that a good approximation of the value funct...

Masoumeh T. Izadi, Doina Precup, Danielle Azar

claim paper

Read More »

13

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

13 years 8 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

14

click to vote

ICML
2003
IEEE

150views Machine Learning» more ICML 2003»

The Significance of Temporal-Difference Learning in Self-Play Training TD-Rummy versus EVO-rummy

13 years 9 months ago

Download www.hpl.hp.com

Reinforcement learning has been used for training game playing agents. The value function for a complex game must be approximated with a continuous function because the number of ...

Clifford Kotnik, Jugal K. Kalita

claim paper

Read More »

12

click to vote

SAINT
2003
IEEE

95views Internet Technology» more SAINT 2003»

A Generalized Target-Driven Cache Replacement Policy for Mobile Environments

13 years 9 months ago

Download www.cs.iastate.edu

Caching frequently accessed data items on the client side is an effective technique to improve the system performance in wireless networks. Due to cache size limitations, cache re...

Liangzhong Yin, Guohong Cao, Ying Cai

claim paper

Read More »

18

click to vote

CCIA
2005
Springer

117views Artificial Intelligence» more CCIA 2005»

Direct Policy Search Reinforcement Learning for Robot Control

13 years 10 months ago

Download vicorob.udg.es

— This paper proposes a high-level Reinforcement Learning (RL) control system for solving the action selection problem of an autonomous robot. Although the dominant approach, whe...

Andres El-Fakdi, Marc Carreras, Narcís Palo...

claim paper

Read More »

10

click to vote

ATAL
2005
Springer

130views Intelligent Agents» more ATAL 2005»

Behavior transfer for value-function-based reinforcement learning

13 years 10 months ago

Download www.cs.huji.ac.il

Temporal difference (TD) learning methods [22] have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

21

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

13 years 10 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

14

click to vote

ATAL
2007
Springer

185views Intelligent Agents» more ATAL 2007»

On opportunistic techniques for solving decentralized Markov decision processes with temporal constraints

13 years 10 months ago

Download www.aamas-conference.org

Decentralized Markov Decision Processes (DEC-MDPs) are a popular model of agent-coordination problems in domains with uncertainty and time constraints but very difﬁcult to solve...

Janusz Marecki, Milind Tambe

claim paper

Read More »

11

click to vote

IROS
2009
IEEE

155views Robotics» more IROS 2009»

Active learning using mean shift optimization for robot grasping

13 years 11 months ago

Download www.kyb.tuebingen.mpg.de

— When children learn to grasp a new object, they often know several possible grasping points from observing a parent’s demonstration and subsequently learn better grasps by tr...

Oliver Kroemer, Renaud Detry, Justus H. Piater, Ja...

claim paper

Read More »

12

click to vote

ICRA
2009
IEEE

227views Robotics» more ICRA 2009»

Adaptive autonomous control using online value iteration with gaussian processes

13 years 11 months ago

Download www-personal.acfr.usyd.edu.au

— In this paper, we present a novel approach to controlling a robotic system online from scratch based on the reinforcement learning principle. In contrast to other approaches, o...

Axel Rottmann, Wolfram Burgard

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers