Markov decision processes

17

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

11 years 6 months ago

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

25

click to vote

CORR
2012
Springer

286views Education» more CORR 2012»

A Faster Algorithm for Solving One-Clock Priced Timed Games

12 years 1 days ago

Download www.daimi.au.dk

One-clock priced timed games is a class of two-player, zero-sum, continuous-time games that was deﬁned and thoroughly studied in previous works. We show that One-clock priced ti...

Thomas Dueholm Hansen, Rasmus Ibsen-Jensen, Peter ...

claim paper

Read More »

33

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

12 years 2 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

33

click to vote

Publication

233views

Sparse reward processes

12 years 2 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

14

click to vote

ATAL
2011
Springer

169views Intelligent Agents» more ATAL 2011»

Towards a unifying characterization for quantifying weak coupling in dec-POMDPs

12 years 4 months ago

Download ai.eecs.umich.edu

Researchers in the ﬁeld of multiagent sequential decision making have commonly used the terms “weakly-coupled” and “loosely-coupled” to qualitatively classify problems i...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

15

click to vote

IWQOS
2011
Springer

230views Communications» more IWQOS 2011»

An MDP-based admission control for a QoS-aware service-oriented system

12 years 7 months ago

Download www.ce.uniroma2.it

In this paper, we address the problem of providing a service broker, which offers to prospective users a composite service with a range of different Quality of Service (QoS) class...

Marco Abundo, Valeria Cardellini, Francesco Lo Pre...

claim paper

Read More »

15

click to vote

AIPS
2011

233views Artificial Intelligence» more AIPS 2011»

Sample-Based Planning for Continuous Action Markov Decision Processes

12 years 8 months ago

Download www.chrismansley.com

In this paper, we present a new algorithm that integrates recent advances in solving continuous bandit problems with sample-based rollout methods for planning in Markov Decision P...

Christopher R. Mansley, Ari Weinstein, Michael L. ...

claim paper

Read More »

11

click to vote

AAAI
1994

117views Intelligent Agents» more AAAI 1994»

Control Strategies for a Stochastic Planner

13 years 5 months ago

Download alumnus.caltech.edu

We present new algorithms for local planning over Markov decision processes. The base-level algorithm possesses several interesting features for control of computation, based on s...

Jonathan Tash, Stuart J. Russell

claim paper

Read More »

12

click to vote

UAI
2000

168views Artificial Intelligence» more UAI 2000»

The Complexity of Decentralized Control of Markov Decision Processes

13 years 5 months ago

Download www.cs.umass.edu

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalization...

Daniel S. Bernstein, Shlomo Zilberstein, Neil Imme...

claim paper

Read More »

13

click to vote

UAI
1998

91views Artificial Intelligence» more UAI 1998»

Hierarchical Solution of Markov Decision Processes using Macro-actions

13 years 5 months ago

Download www.cs.toronto.edu

tigate the use of temporally abstract actions, or macro-actions, in the solution of Markov decision processes. Unlike current models that combine both primitive actions and macro-...

Milos Hauskrecht, Nicolas Meuleau, Leslie Pack Kae...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers