Sciweavers

1176 search results - page 7 / 236
» Sparse reward processes
Sort
View
CCIA
2009
Springer
15 years 6 months ago
Reward System for Completing FAQs
The creation of Answer Communities around a FAQs Site is proposed to speed up the process of answering questions. Our approach combines long-term and short-term rewards. Long-term ...
Araceli Moreno, Josep Lluís de la Rosa, Bol...
ARCS
2005
Springer
15 years 11 months ago
Adaptive Object Acquisition
We propose an active vision system for object acquisition. The core of our approach is a reinforcement learning module which learns a strategy to scan an object. The agent moves a...
Gabriele Peters, Claus-Peter Alberts, Markus Bries...
JMLR
2010
189views more  JMLR 2010»
15 years 11 days ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
179
Voted
FOCS
2007
IEEE
15 years 12 months ago
Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards
We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...
Sudipto Guha, Kamesh Munagala