Search Sciweavers | Sciweavers

162 search results - page 13 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

202

click to vote

ISAAC
2010
Springer

243views Algorithms» more ISAAC 2010»

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

15 years 4 months ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...

Thomas Dueholm Hansen, Uri Zwick

claim paper

Read More »

189

click to vote

UAI
2000

91views Artificial Intelligence» more UAI 2000»

Value-Directed Belief State Approximation for POMDPs

15 years 7 months ago

Download www.cs.uwaterloo.ca

We consider the problem belief-state monitoring for the purposes of implementing a policy for a partially-observable Markov decision process (POMDP), specifically how one might ap...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

177

click to vote

DATE
2008
IEEE

136views Hardware» more DATE 2008»

A Framework of Stochastic Power Management Using Hidden Markov Model

16 years 15 days ago

Download www.date-conference.com

- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...

Ying Tan, Qinru Qiu

claim paper

Read More »

171

click to vote

SIGMETRICS
2000
ACM

105views Hardware» more SIGMETRICS 2000»

Using the exact state space of a Markov model to compute approximate stationary measures

15 years 10 months ago

Download www.cs.ucr.edu

We present a new approximation algorithm based on an exact representation of the state space S, using decision diagrams, and of the transition rate matrix R, using Kronecker algeb...

Andrew S. Miner, Gianfranco Ciardo, Susanna Donate...

claim paper

Read More »

187

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 6 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 13 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers