Sciweavers

288 search results - page 32 / 58
» Risk-averse dynamic programming for Markov decision processe...
Sort
View
NIPS
2007
15 years 1 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
91
Voted
ICPR
2006
IEEE
16 years 26 days ago
Learning Policies for Efficiently Identifying Objects of Many Classes
Viola and Jones (VJ) cascade classification methods have proven to be very successful in detecting objects belonging to a single class -- e.g., faces. This paper addresses the mor...
Ahmed M. Elgammal, Ramana Isukapalli, Russell Grei...
ICML
2004
IEEE
16 years 17 days ago
Bellman goes relational
Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...
Kristian Kersting, Martijn Van Otterlo, Luc De Rae...
ICC
2007
IEEE
15 years 6 months ago
Dynamic Lightpath Establishment for Service Differentiation Based on Optimal MDP Policy in All-Optical Networks with Wavelength
— In this paper, we propose a dynamic lightpath establishment method for service differentiation in all-optical WDM networks with the capability of full-range wavelength conversi...
Takuji Tachibana, Shoji Kasahara, Kenji Sugimoto
IFIP
2009
Springer
15 years 6 months ago
HMM-Based Trust Model
Probabilistic trust has been adopted as an approach to taking security sensitive decisions in modern global computing environments. Existing probabilistic trust frameworks either a...
Ehab ElSalamouny, Vladimiro Sassone, Mogens Nielse...