Sciweavers

53 search results - page 11 / 11
» Adapting Reinforcement Learning for Trust: Effective Modelin...
Sort
View
RAS
2006
138views more  RAS 2006»
13 years 4 months ago
From pixels to multi-robot decision-making: A study in uncertainty
Mobile robots must cope with uncertainty from many sources along the path from interpreting raw sensor inputs to behavior selection to execution of the resulting primitive actions...
Peter Stone, Mohan Sridharan, Daniel Stronger, Gre...
ICAC
2005
IEEE
13 years 10 months ago
Towards Autonomic Virtual Applications in the In-VIGO System
Grid environments enable users to share non-dedicated resources that lack performance guarantees. This paper describes the design of application-centric middleware components to a...
Jing Xu, Sumalatha Adabala, José A. B. Fort...
JMLR
2008
129views more  JMLR 2008»
13 years 5 months ago
Finite-Time Bounds for Fitted Value Iteration
In this paper we develop a theoretical analysis of the performance of sampling-based fitted value iteration (FVI) to solve infinite state-space, discounted-reward Markovian decisi...
Rémi Munos, Csaba Szepesvári