Search Sciweavers | Sciweavers

201 search results - page 19 / 41

» Solving Concurrent Markov Decision Processes

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 8 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

click to vote

FLAIRS
2008

65views Artificial Intelligence» more FLAIRS 2008»

Planning for Welfare to Work

15 years 4 months ago

Download www.cs.uky.edu

We are interested in building decision-support software for social welfare case managers. Our model in the form of a factored Markov decision process is so complex that a standard...

Liangrong Yi, Raphael A. Finkel, Judy Goldsmith

claim paper

Read More »

127

click to vote

AAAI
2010

201views Intelligent Agents» more AAAI 2010»

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization

15 years 3 months ago

Download www.cs.umass.edu

Partially Observable Markov Decision Processes (POMDPs) are a well-established and rigorous framework for sequential decision-making under uncertainty. POMDPs are well-known to be...

Georgios Theocharous, Sridhar Mahadevan

claim paper

Read More »

121

click to vote

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 3 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

107

click to vote

NIPS
2004

224views Information Technology» more NIPS 2004»

Approximately Efficient Online Mechanism Design

15 years 3 months ago

Download www.cs.cmu.edu

Online mechanism design (OMD) addresses the problem of sequential decision making in a stochastic environment with multiple self-interested agents. The goal in OMD is to make valu...

David C. Parkes, Satinder P. Singh, Dimah Yanovsky

claim paper

Read More »

« Prev « First page 19 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers