Search Sciweavers | Sciweavers

238 search results - page 30 / 48

» Value-Function Approximations for Partially Observable Marko...

124

click to vote

ACL
2010

175views Computational Linguistics» more ACL 2010»

Towards Relational POMDPs for Adaptive Dialogue Management

14 years 9 months ago

Download aclweb.org

Open-ended spoken interactions are typically characterised by both structural complexity and high levels of uncertainty, making dialogue management in such settings a particularly...

Pierre Lison

claim paper

Read More »

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

14 years 11 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

15 years 6 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 1 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

112

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

15 years 5 months ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

« Prev « First page 30 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers