Search Sciweavers | Sciweavers

38 search results - page 7 / 8

» On the Convergence of Optimistic Policy Iteration

click to vote

CORR
2010
Springer

170views Education» more CORR 2010»

Global Optimization for Value Function Approximation

13 years 5 months ago

Download www.cs.umass.edu

Existing value function approximation methods have been successfully used in many applications, but they often lack useful a priori error bounds. We propose a new approximate bili...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

INFOCOM
1995
IEEE

122views Communications» more INFOCOM 1995»

Complexity of Gradient Projection Method for Optimal Routing in Data Networks

13 years 9 months ago

Download www.cs.ou.edu

—In this paper, we derive a time-complexity bound for the gradient projection method for optimal routing in data networks. This result shows that the gradient projection algorith...

Wei Kang Tsai, John K. Antonio, Garng M. Huang

claim paper

Read More »

click to vote

AIPS
2011

216views Artificial Intelligence» more AIPS 2011»

Heuristic Search for Generalized Stochastic Shortest Path MDPs

12 years 9 months ago

Download www.cs.washington.edu

Research in efﬁcient methods for solving inﬁnite-horizon MDPs has so far concentrated primarily on discounted MDPs and the more general stochastic shortest path problems (SSPs...

Andrey Kolobov, Mausam, Daniel S. Weld, Hector Gef...

claim paper

Read More »

click to vote

IPPS
2002
IEEE

125views Distributed And Parallel Com...» more IPPS 2002»

Optimal Remapping in Dynamic Bulk Synchronous Computations via a Stochastic Control Approach

13 years 10 months ago

Download www.ece.eng.wayne.edu

A bulk synchronous computation proceeds in phases that are separated by barrier synchronization. For dynamic bulk synchronous computations that exhibit varying phase-wise computat...

Gang George Yin, Cheng-Zhong Xu, Le Yi Wang

claim paper

Read More »

click to vote

UAI
2004

121views Artificial Intelligence» more UAI 2004»

Discretized Approximations for POMDP with Average Cost

13 years 7 months ago

Download web.mit.edu

In this paper, we propose a new lower approximation scheme for POMDP with discounted and average cost criterion. The approximating functions are determined by their values at a fi...

Huizhen Yu, Dimitri P. Bertsekas

claim paper

Read More »

« Prev « First page 7 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers