Search Sciweavers | Sciweavers

1138 search results - page 105 / 228

» Feature Markov Decision Processes

138

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

15 years 9 days ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

170

click to vote

ROOM
2000

149views Programming Languages» more ROOM 2000»

OO-Motivated Process Algebra: A Calculus for CORBA-like Systems

15 years 6 months ago

Download www.bcs.org

This paper is a proposal for a new two-tier calculus, designed to model aspects of CORBA-like systems at the CORBA evel. The higher object level known as Oompa abstracts away from...

Malcolm Tyrrell, Andrew Butterfield, Alexis Donnel...

claim paper

Read More »

141

click to vote

TITB
2010

95views Education» more TITB 2010»

Sleep staging based on signals acquired through bed sensor

15 years 7 days ago

Download heartcycle.med.auth.gr

We describe a system for the evaluation of the sleep macrostructure on the basis of Emfit sensor foils placed into bed mattress and of advanced signal processing. The signals on wh...

Juha M. Kortelainen, Martin O. Mendez, Anna M. Bia...

claim paper

Read More »

150

click to vote

AIPS
2008

155views Artificial Intelligence» more AIPS 2008»

HiPPo: Hierarchical POMDPs for Planning Information Processing and Sensing Actions on a Robot

15 years 7 months ago

Download www.cs.bham.ac.uk

Flexible general purpose robots need to tailor their visual processing to their task, on the fly. We propose a new approach to this within a planning framework, where the goal is ...

Mohan Sridharan, Jeremy L. Wyatt, Richard Dearden

claim paper

Read More »

176

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

15 years 12 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 105 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers