Search Sciweavers | Sciweavers

148 search results - page 10 / 30

» Reinforcement Learning for P2P Searching

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

16 years 2 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

118

click to vote

ICML
1999
IEEE

138views Machine Learning» more ICML 1999»

Using Reinforcement Learning to Spider the Web Efficiently

16 years 2 months ago

Download www.cs.iastate.edu

Consider the task of exploring the Web in order to find pages of a particular kind or on a particular topic. This task arises in the construction of search engines and Web knowled...

Jason Rennie, Andrew McCallum

claim paper

Read More »

123

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

15 years 3 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

106

click to vote

ECML
2006
Springer

116views Machine Learning» more ECML 2006»

Scaling Model-Based Average-Reward Reinforcement Learning for Product Delivery

15 years 5 months ago

Download web.engr.oregonstate.edu

Reinforcement learning in real-world domains suffers from three curses of dimensionality: explosions in state and action spaces, and high stochasticity. We present approaches that ...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

141

Voted

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

14 years 11 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

« Prev « First page 10 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers