Search Sciweavers | Sciweavers

9841 search results - page 9 / 1969

» Distributed Value Functions

148

Voted

ATAL
2007
Springer

142views Intelligent Agents» more ATAL 2007»

Q-value functions for decentralized POMDPs

15 years 10 months ago

Download www.science.uva.nl

Planning in single-agent models like MDPs and POMDPs can be carried out by resorting to Q-value functions: a (near-) optimal Q-value function is computed in a recursive manner by ...

Frans A. Oliehoek, Nikos A. Vlassis

claim paper

Read More »

143

click to vote

JMLR
2008

129views more JMLR 2008»

Finite-Time Bounds for Fitted Value Iteration

15 years 4 months ago

Download www.sztaki.hu

In this paper we develop a theoretical analysis of the performance of sampling-based fitted value iteration (FVI) to solve infinite state-space, discounted-reward Markovian decisi...

Rémi Munos, Csaba Szepesvári

claim paper

Read More »

131

click to vote

INFOVIS
2000
IEEE

129views Visualization» more INFOVIS 2000»

Density Functions for Visual Attributes and Effective Partitioning in Graph Visualization

15 years 8 months ago

Download homepages.cwi.nl

Two tasks in Graph Visualization require partitioning: the assignment of visual attributes and divisive clustering. Often, we would like to assign a color or other visual attribut...

Ivan Herman, M. Scott Marshall, Guy Melanço...

claim paper

Read More »

139

click to vote

AAAI
2006

126views Intelligent Agents» more AAAI 2006»

Functional Value Iteration for Decision-Theoretic Planning with General Utility Functions

15 years 5 months ago

Download www.aaai.org

We study how to find plans that maximize the expected total utility for a given MDP, a planning objective that is important for decision making in high-stakes domains. The optimal...

Yaxin Liu, Sven Koenig

claim paper

Read More »

142

click to vote

ESANN
2008

164views Neural Networks» more ESANN 2008»

Multilayer Perceptrons with Radial Basis Functions as Value Functions in Reinforcement Learning

15 years 5 months ago

Download www.dice.ucl.ac.be

Using multilayer perceptrons (MLPs) to approximate the state-action value function in reinforcement learning (RL) algorithms could become a nightmare due to the constant possibilit...

Victor Uc Cetina

claim paper

Read More »

« Prev « First page 9 / 1969 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers