Search Sciweavers | Sciweavers

87 search results - page 4 / 18

» Dynamic Programming for Partially Observable Stochastic Game...

192

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 8 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

270

click to vote

PAMI
2007

186views more PAMI 2007»

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

15 years 6 months ago

Download people.ee.duke.edu

—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...

Jesse Hoey, James J. Little

claim paper

Read More »

201

click to vote

AAAI
1996

197views Intelligent Agents» more AAAI 1996»

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations

15 years 8 months ago

Download people.cs.ubc.ca

: Partially-observable Markov decision processes provide a very general model for decision-theoretic planning problems, allowing the trade-offs between various courses of actions t...

Craig Boutilier, David Poole

claim paper

Read More »

176

click to vote

ICCBR
2010
Springer

261views Automated Reasoning» more ICCBR 2010»

Imitating Inscrutable Enemies: Learning from Stochastic Policy Observation, Retrieval and Reuse

15 years 11 months ago

Download www.cse.lehigh.edu

In this paper we study the topic of CBR systems learning from observations in which those observations can be represented as stochastic policies. We describe a general framework wh...

Kellen Gillespie, Justin Karneeb, Stephen Lee-Urba...

claim paper

Read More »

215

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

15 years 9 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

« Prev « First page 4 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers