Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

132

AAAI
2006

142views Intelligent Agents» more AAAI 2006»

Learning Basis Functions in Hybrid Domains

15 years 6 months ago

Learning Basis Functions in Hybrid Domains

Download www.aaai.org

Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea of the approach is to approximate the optimal value function by a set of basis functions and optimize their weights by linear programming. The quality of this approximation naturally depends on its basis functions. However, basis functions leading to good approximations are rarely known in advance. In this paper, we propose a new approach that discovers these functions automatically. The method relies on a class of parametric basis function models, which are optimized using the dual formulation of a relaxed HALP. We demonstrate the performance of our method on two hybrid optimization problems and compare it to manually selected basis functions.

Branislav Kveton, Milos Hauskrecht

Real-time Traffic

AAAI 2006 | Basis Functions | Hybrid Approximate Linear | Intelligent Agents | Linear Programming |

claim paper

Related Content

» Basis function construction for hierarchical reinforcement learning

» RegionBased Image Retrieval using Radial Basis Function Network

» Learning stateaction basis functions for hierarchical MDPs

» Normalized Gaussian Radial Basis Function networks

» Graph Laplacian based transfer learning in reinforcement learning

» Creating an empirical basis for adaptation decisions

» Learning Novel Domains Through Curiosity and Conjecture

» Learning Functional ObjectCategories from a Relational SpatioTemporal Representation

» A Hybrid Approach for Learning Parameters of Probabilistic Networks from Incomplete Databa...

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	AAAI
Authors	Branislav Kveton, Milos Hauskrecht

Comments (0)