Sciweavers

1138 search results - page 105 / 228
» Feature Markov Decision Processes
Sort
View
95
Voted
JMLR
2010
125views more  JMLR 2010»
14 years 7 months ago
Variational methods for Reinforcement Learning
We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...
Thomas Furmston, David Barber
106
Voted
ROOM
2000
15 years 2 months ago
OO-Motivated Process Algebra: A Calculus for CORBA-like Systems
This paper is a proposal for a new two-tier calculus, designed to model aspects of CORBA-like systems at the CORBA evel. The higher object level known as Oompa abstracts away from...
Malcolm Tyrrell, Andrew Butterfield, Alexis Donnel...
94
Voted
TITB
2010
95views Education» more  TITB 2010»
14 years 7 months ago
Sleep staging based on signals acquired through bed sensor
We describe a system for the evaluation of the sleep macrostructure on the basis of Emfit sensor foils placed into bed mattress and of advanced signal processing. The signals on wh...
Juha M. Kortelainen, Martin O. Mendez, Anna M. Bia...
AIPS
2008
15 years 3 months ago
HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot
Flexible general purpose robots need to tailor their visual processing to their task, on the fly. We propose a new approach to this within a planning framework, where the goal is ...
Mohan Sridharan, Jeremy L. Wyatt, Richard Dearden
123
Voted
FOCS
2007
IEEE
15 years 7 months ago
Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards
We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...
Sudipto Guha, Kamesh Munagala