Search Sciweavers | Sciweavers

656 search results - page 76 / 132

» Complexity of finite-horizon Markov decision process problem...

129

click to vote

ICN
2007
Springer

97views Computer Networks» more ICN 2007»

Heuristic Approach of Optimal Code Allocation in High Speed Downlink Packet Access Networks

15 years 8 months ago

Download www.sce.carleton.ca

— In this paper, we use the Markov Decision Process (MDP) technique to ﬁnd the optimal code allocation policy in High-Speed Downlink Packet Access (HSDPA) networks. A discrete ...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

135

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 3 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

UAI
2004

101views Artificial Intelligence» more UAI 2004»

Region-Based Incremental Pruning for POMDPs

15 years 3 months ago

Download anytime.cs.umass.edu

We present a major improvement to the incremental pruning algorithm for solving partially observable Markov decision processes. Our technique targets the cross-sum step of the dyn...

Zhengzhu Feng, Shlomo Zilberstein

claim paper

Read More »

139

click to vote

ISAAC
2010
Springer

243views Algorithms» more ISAAC 2010»

Lower Bounds for Howard's Algorithm for Finding Minimum Mean-Cost Cycles

14 years 12 months ago

Download www.daimi.au.dk

Howard's policy iteration algorithm is one of the most widely used algorithms for finding optimal policies for controlling Markov Decision Processes (MDPs). When applied to we...

Thomas Dueholm Hansen, Uri Zwick

claim paper

Read More »

151

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 8 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

« Prev « First page 76 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers