Search Sciweavers | Sciweavers

9 search results - page 2 / 2

» Purely Epistemic Markov Decision Processes

click to vote

SODA
2010
ACM

190views Algorithms» more SODA 2010»

One-Counter Markov Decision Processes

14 years 2 months ago

Download www.fi.muni.cz

We study the computational complexity of some central analysis problems for One-Counter Markov Decision Processes (OC-MDPs), a class of finitely-presented, countable-state MDPs. O...

Tomas Brazdil, Vaclav Brozek, Kousha Etessami, Ant...

claim paper

Read More »

click to vote

TACAS
2007
Springer

165views Algorithms» more TACAS 2007»

Multi-objective Model Checking of Markov Decision Processes

13 years 11 months ago

Download qav.comlab.ox.ac.uk

We study and provide eﬃcient algorithms for multi-objective model checking problems for Markov Decision Processes (MDPs). Given an MDP, M, and given multiple linear-time (ω-regu...

Kousha Etessami, Marta Z. Kwiatkowska, Moshe Y. Va...

claim paper

Read More »

click to vote

ECAI
2008
Springer

114views Artificial Intelligence» more ECAI 2008»

A hybrid approach to multi-agent decision-making

13 years 6 months ago

Download www.deetc.isel.ipl.pt

Abstract. In the aftermath of a large-scale disaster, agents’ decisions derive from self-interested (e.g. survival), common-good (e.g. victims’ rescue) and teamwork (e.g. ﬁre...

Paulo Trigo, Helder Coelho

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

13 years 11 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 2 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers