Search Sciweavers | Sciweavers

9 search results - page 1 / 2

» A gradient-based reinforcement learning approach to dynamic ...

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

13 years 6 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

click to vote

FGCS
2008

68views more FGCS 2008»

A gradient-based reinforcement learning approach to dynamic pricing in partially-observable environments

13 years 4 months ago

Download labs.oracle.com

David Vengerov

claim paper

Read More »

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

13 years 11 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

click to vote

WECWIS
2003
IEEE

120views ECommerce» more WECWIS 2003»

Reinforcement Learning Applications in Dynamic Pricing of Retail Markets

13 years 9 months ago

Download lcm.csa.iisc.ernet.in

In this paper, we investigate the use of reinforcement learning (RL) techniques to the problem of determining dynamic prices in an electronic retail market. As representative mode...

C. V. L. Raju, Y. Narahari, K. Ravikumar

claim paper

Read More »

click to vote

CSE
2008
IEEE

172views Theoretical Computer Science» more CSE 2008»

Adaptation to Dynamic Resource Availability in Ad Hoc Grids through a Learning Mechanism

13 years 11 months ago

Download ce.et.tudelft.nl

Ad-hoc Grids are highly heterogeneous and dynamic networks, one of the main challenges of resource allocation in such environments is to ﬁnd mechanisms which do not rely on the ...

Behnaz Pourebrahimi, Koen Bertels

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers