value functions | Sciweavers

13

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

12 years 11 months ago

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

14

click to vote

JMLR
2010

135views more JMLR 2010»

Finite-sample Analysis of Bellman Residual Minimization

12 years 11 months ago

Download jmlr.csail.mit.edu

We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is avai...

Odalric-Ambrym Maillard, Rémi Munos, Alessa...

claim paper

Read More »

17

click to vote

ICMLA
2009

171views Machine Learning» more ICMLA 2009»

Multiagent Transfer Learning via Assignment-Based Decomposition

13 years 2 months ago

Download web.engr.oregonstate.edu

We describe a system that successfully transfers value function knowledge across multiple subdomains of realtime strategy games in the context of multiagent reinforcement learning....

Scott Proper, Prasad Tadepalli

claim paper

Read More »

17

click to vote

ICRA
2010
IEEE

101views Robotics» more ICRA 2010»

Multirobot coordination by auctioning POMDPs

13 years 3 months ago

Download users.isr.ist.utl.pt

— We consider the problem of task assignment and execution in multirobot systems, by proposing a procedure for bid estimation in auction protocols. Auctions are of interest to mu...

Matthijs T. J. Spaan, Nelson Gonçalves, Jo&...

claim paper

Read More »

8

click to vote

MP
2006

105views more MP 2006»

Two-stage integer programs with stochastic right-hand sides: a superadditive dual approach

13 years 4 months ago

Download www.engr.pitt.edu

We consider two-stage pure integer programs with discretely distributed stochastic right-hand sides. We present an equivalent superadditive dual formulation that uses the value fun...

Nan Kong, Andrew J. Schaefer, Brady Hunsaker

claim paper

Read More »

13

click to vote

AI
2008
Springer

116views Artificial Intelligence» more AI 2008»

Graphically structured value-function compilation

13 years 4 months ago

Download www.cs.bgu.ac.il

Classical work on eliciting and representing preferences over multi-attribute alternatives has attempted to recognize conditions under which value functions take on particularly s...

Ronen I. Brafman, Carmel Domshlak

claim paper

Read More »

17

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

13 years 5 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

11

click to vote

NIPS
2000

121views Information Technology» more NIPS 2000»

APRICODD: Approximate Policy Construction Using Decision Diagrams

13 years 5 months ago

Download www.cs.ubc.ca

We propose a method of approximate dynamic programming for Markov decision processes (MDPs) using algebraic decision diagrams (ADDs). We produce near-optimal value functions and p...

Robert St-Aubin, Jesse Hoey, Craig Boutilier

claim paper

Read More »

17

click to vote

DAGSTUHL
2008

131views Software Engineering» more DAGSTUHL 2008»

Interactive Multiobjective Optimization Using a Set of Additive Value Functions

13 years 6 months ago

Download www.lamsade.dauphine.fr

Abstract. In this chapter, we present a new interactive procedure for multiobjective optimization, which is based on the use of a set of value functions as a preference model built...

José Rui Figueira, Salvatore Greco, Vincent...

claim paper

Read More »

14

click to vote

GECCO
2010
Springer

153views Optimization» more GECCO 2010»

Multi-task evolutionary shaping without pre-specified representations

13 years 7 months ago

Download www.science.uva.nl

Shaping functions can be used in multi-task reinforcement learning (RL) to incorporate knowledge from previously experienced tasks to speed up learning on a new task. So far, rese...

Matthijs Snel, Shimon Whiteson

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers