Search Sciweavers | Sciweavers

13 search results - page 2 / 3

» Minimal Sufficient Explanations for Factored Markov Decision...

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

13 years 3 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

click to vote

ICTAI
2010
IEEE

226views Artificial Intelligence» more ICTAI 2010»

A Closer Look at MOMDPs

13 years 3 months ago

Download www.loria.fr

Abstract--The difficulties encountered in sequential decisionmaking problems under uncertainty are often linked to the large size of the state space. Exploiting the structure of th...

Mauricio Araya-López, Vincent Thomas, Olivi...

claim paper

Read More »

click to vote

IUI
2010
ACM

207views Software Engineering» more IUI 2010»

A POMDP approach to P300-based brain-computer interfaces

14 years 2 months ago

Download ailab.kaist.ac.kr

Most of the previous work on non-invasive brain-computer interfaces (BCIs) has been focused on feature extraction and classification algorithms to achieve high performance for the...

Jaeyoung Park, Kee-Eung Kim, Sungho Jo

claim paper

Read More »

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

13 years 6 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

13 years 5 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers