Search Sciweavers | Sciweavers

65 search results - page 1 / 13

» Gradient Descent for General Reinforcement Learning

click to vote

NIPS
1998

140views Information Technology» more NIPS 1998»

Gradient Descent for General Reinforcement Learning

13 years 6 months ago

Download www.ri.cmu.edu

A simple learning rule is derived, the VAPS algorithm, which can be instantiated to generate a wide range of new reinforcementlearning algorithms. These algorithms solve a number ...

Leemon C. Baird III, Andrew W. Moore

claim paper

Read More »

click to vote

ALIFE
2002

176views Modeling And Simulation» more ALIFE 2002»

Ant Colony Optimization and Stochastic Gradient Descent

13 years 4 months ago

Download ti.arc.nasa.gov

In this paper, we study the relationship between the two techniques known as ant colony optimization (aco) and stochastic gradient descent. More precisely, we show that some empir...

Nicolas Meuleau, Marco Dorigo

claim paper

Read More »

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

14 years 5 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

click to vote

GECCO
2007
Springer

168views Optimization» more GECCO 2007»

Empirical analysis of generalization and learning in XCS with gradient descent

13 years 11 months ago

Download www.psychologie.uni-wuerzburg.de

We analyze generalization and learning in XCS with gradient descent. At ﬁrst, we show that the addition of gradient in XCS may slow down learning because it indirectly decreases...

Pier Luca Lanzi, Martin V. Butz, David E. Goldberg

claim paper

Read More »

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

13 years 4 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

« Prev « First page 1 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers