Sciweavers

1176 search results - page 7 / 236
» Sparse reward processes
Sort
View
CCIA
2009
Springer
15 years 24 days ago
Reward System for Completing FAQs
The creation of Answer Communities around a FAQs Site is proposed to speed up the process of answering questions. Our approach combines long-term and short-term rewards. Long-term ...
Araceli Moreno, Josep Lluís de la Rosa, Bol...
ARCS
2005
Springer
15 years 5 months ago
Adaptive Object Acquisition
We propose an active vision system for object acquisition. The core of our approach is a reinforcement learning module which learns a strategy to scan an object. The agent moves a...
Gabriele Peters, Claus-Peter Alberts, Markus Bries...
JMLR
2010
189views more  JMLR 2010»
14 years 6 months ago
Adaptive Step-size Policy Gradients with Average Reward Metric
In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...
Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...
114
Voted
FOCS
2007
IEEE
15 years 6 months ago
Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards
We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...
Sudipto Guha, Kamesh Munagala