Search Sciweavers | Sciweavers

201 search results - page 18 / 41

» Solving Concurrent Markov Decision Processes

212

click to vote

ATAL
2007
Springer

145views Intelligent Agents» more ATAL 2007»

Interactive dynamic influence diagrams

15 years 11 months ago

Download www.sci.brooklyn.cuny.edu

This paper extends the framework of dynamic influence diagrams (DIDs) to the multi-agent setting. DIDs are computational representations of the Partially Observable Markov Decisio...

Kyle Polich, Piotr J. Gmytrasiewicz

claim paper

Read More »

213

click to vote

IUI
2010
ACM

207views Software Engineering» more IUI 2010»

A POMDP approach to P300-based brain-computer interfaces

16 years 4 months ago

Download ailab.kaist.ac.kr

Most of the previous work on non-invasive brain-computer interfaces (BCIs) has been focused on feature extraction and classification algorithms to achieve high performance for the...

Jaeyoung Park, Kee-Eung Kim, Sungho Jo

claim paper

Read More »

223

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 11 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

207

click to vote

STACS
1997
Springer

137views Theoretical Computer Science» more STACS 1997»

Methods and Applications of (MAX, +) Linear Algebra

15 years 11 months ago

Download www-rocq.inria.fr

Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...

Stephane Gaubert, Max Plus

claim paper

Read More »

196

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 2 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

« Prev « First page 18 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers