Search Sciweavers | Sciweavers

29 search results - page 4 / 6

» Linear Program Approximations for Factored Continuous-State ...

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

13 years 11 months ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

AIPS
2009

144views Artificial Intelligence» more AIPS 2009»

Efficient Solutions to Factored MDPs with Imprecise Transition Probabilities

13 years 6 months ago

Download www.ime.usp.br

When modeling real-world decision-theoretic planning problems in the Markov decision process (MDP) framework, it is often impossible to obtain a completely accurate estimate of tr...

Karina Valdivia Delgado, Scott Sanner, Leliane Nun...

claim paper

Read More »

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Planning in Factored Action Spaces with Symbolic Dynamic Programming

11 years 8 months ago

Download www.cs.tufts.edu

We consider symbolic dynamic programming (SDP) for solving Markov Decision Processes (MDP) with factored state and action spaces, where both states and actions are described by se...

Aswin Raghavan, Saket Joshi, Alan Fern, Prasad Tad...

claim paper

Read More »

click to vote

ATAL
2010
Springer

128views Intelligent Agents» more ATAL 2010»

Approximate dynamic programming with affine ADDs

13 years 23 days ago

Download eprints.pascal-network.org

The Affine ADD (AADD) is an extension of the Algebraic Decision Diagram (ADD) that compactly represents context-specific, additive and multiplicative structure in functions from a...

Scott Sanner, William T. B. Uther, Karina Valdivia...

claim paper

Read More »

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

13 years 7 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 4 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers