Search Sciweavers | Sciweavers

238 search results - page 23 / 48

» Value-Function Approximations for Partially Observable Marko...

151

click to vote

UAI
2008

230views Artificial Intelligence» more UAI 2008»

Partitioned Linear Programming Approximations for MDPs

15 years 6 months ago

Download uai2008.cs.helsinki.fi

Approximate linear programming (ALP) is an efficient approach to solving large factored Markov decision processes (MDPs). The main idea of the method is to approximate the optimal...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

112

click to vote

NIPS
2004

128views Information Technology» more NIPS 2004»

A Cost-Shaping LP for Bellman Error Minimization with Performance Guarantees

15 years 6 months ago

Download books.nips.cc

We introduce a new algorithm based on linear programming that approximates the differential value function of an average-cost Markov decision process via a linear combination of p...

Daniela Pucci de Farias, Benjamin Van Roy

claim paper

Read More »

153

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 6 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

160

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 6 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

133

click to vote

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

15 years 6 months ago

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

« Prev « First page 23 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers