Search Sciweavers | Sciweavers

1176 search results - page 7 / 236

» Sparse reward processes

155

click to vote

ICIP
2003
IEEE

96views Image Processing» more ICIP 2003»

Nonlinear approximation based image recovery using adaptive sparse reconstructions

16 years 9 months ago

Download eeweb.poly.edu

Onur G. Guleryuz

claim paper

Read More »

168

Voted

CCIA
2009
Springer

101views Artificial Intelligence» more CCIA 2009»

Reward System for Completing FAQs

15 years 8 months ago

Download cgi2.cs.rpi.edu

The creation of Answer Communities around a FAQs Site is proposed to speed up the process of answering questions. Our approach combines long-term and short-term rewards. Long-term ...

Araceli Moreno, Josep Lluís de la Rosa, Bol...

claim paper

Read More »

266

click to vote

ARCS
2005
Springer

261views Software Engineering» more ARCS 2005»

Adaptive Object Acquisition

16 years 1 months ago

Download www.organic-computing.org

We propose an active vision system for object acquisition. The core of our approach is a reinforcement learning module which learns a strategy to scan an object. The agent moves a...

Gabriele Peters, Claus-Peter Alberts, Markus Bries...

claim paper

Read More »

243

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 2 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

225

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

16 years 2 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 7 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers