Search Sciweavers | Sciweavers

87 search results - page 2 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

ISAAC
2010
Springer

243views Algorithms» more ISAAC 2010»

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

13 years 3 months ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...

Thomas Dueholm Hansen, Uri Zwick

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

13 years 11 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

ISCC
2000
IEEE

104views Communications» more ISCC 2000»

Dynamic Routing and Wavelength Assignment Using First Policy Iteration

13 years 9 months ago

Download www.netlab.tkk.fi

With standard assumptions the routing and wavelength assignment problem (RWA) can be viewed as a Markov Decision Process (MDP). The problem, however, deﬁes an exact solution bec...

Esa Hyytiä, Jorma T. Virtamo

claim paper

Read More »

click to vote

ECAI
2010
Springer

227views Artificial Intelligence» more ECAI 2010»

On Finding Compromise Solutions in Multiobjective Markov Decision Processes

13 years 6 months ago

Download www-desir.lip6.fr

A Markov Decision Process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagen...

Patrice Perny, Paul Weng

claim paper

Read More »

click to vote

CORR
2007
Springer

94views Education» more CORR 2007»

Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm

13 years 5 months ago

Download www.ieee-infocom.org

— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...

Bruce Hajek, Kevin Mitzel, Sichao Yang

claim paper

Read More »

« Prev « First page 2 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers