Search Sciweavers | Sciweavers

268 search results - page 31 / 54

» Solving multiagent assignment Markov decision processes

195

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 2 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

207

click to vote

STACS
1997
Springer

137views Theoretical Computer Science» more STACS 1997»

Methods and Applications of (MAX, +) Linear Algebra

15 years 11 months ago

Download www-rocq.inria.fr

Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...

Stephane Gaubert, Max Plus

claim paper

Read More »

142

Voted

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

16 years 1 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

181

click to vote

FLAIRS
2008

65views Artificial Intelligence» more FLAIRS 2008»

Planning for Welfare to Work

15 years 9 months ago

Download www.cs.uky.edu

We are interested in building decision-support software for social welfare case managers. Our model in the form of a factored Markov decision process is so complex that a standard...

Liangrong Yi, Raphael A. Finkel, Judy Goldsmith

claim paper

Read More »

201

click to vote

AAAI
2010

201views Intelligent Agents» more AAAI 2010»

Compressing POMDPs Using Locality Preserving Non-Negative Matrix Factorization

15 years 9 months ago

Download www.cs.umass.edu

Partially Observable Markov Decision Processes (POMDPs) are a well-established and rigorous framework for sequential decision-making under uncertainty. POMDPs are well-known to be...

Georgios Theocharous, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 31 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers