Search Sciweavers | Sciweavers

7006 search results - page 451 / 1402

» Approximation Algorithms

127

click to vote

ICML
2003
IEEE

146views Machine Learning» more ICML 2003»

TD(0) Converges Provably Faster than the Residual Gradient Algorithm

16 years 5 months ago

Download www.hpl.hp.com

In Reinforcement Learning (RL) there has been some experimental evidence that the residual gradient algorithm converges slower than the TD(0) algorithm. In this paper, we use the ...

Ralf Schoknecht, Artur Merke

claim paper

Read More »

153

click to vote

ICML
2008
IEEE

157views Machine Learning» more ICML 2008»

Efficiently learning linear-linear exponential family predictive representations of state

16 years 5 months ago

Download web.mit.edu

Exponential Family PSR (EFPSR) models capture stochastic dynamical systems by representing state as the parameters of an exponential family distribution over a shortterm window of...

David Wingate, Satinder P. Singh

claim paper

Read More »

136

click to vote

PKDD
2009
Springer

144views Data Mining» more PKDD 2009»

Compositional Models for Reinforcement Learning

15 years 11 months ago

Download userweb.cs.utexas.edu

Abstract. Innovations such as optimistic exploration, function approximation, and hierarchical decomposition have helped scale reinforcement learning to more complex environments, ...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

152

Voted

ICCS
2007
Springer

129views Applied Computing» more ICCS 2007»

Complexity of Monte Carlo Algorithms for a Class of Integral Equations

15 years 11 months ago

Download parallel.bas.bg

In this work we study the computational complexity of a class of grid Monte Carlo algorithms for integral equations. The idea of the algorithms consists in an approximation of the ...

Ivan Dimov, Rayna Georgieva

claim paper

Read More »

155

click to vote

MP
2006

137views more MP 2006»

New algorithms for singly linearly constrained quadratic programs subject to lower and upper bounds

15 years 5 months ago

Download www.soe.ucsc.edu

There are many applications related to singly linearly constrained quadratic programs subjected to upper and lower bounds. In this paper, a new algorithm based on secant approximat...

Yu-Hong Dai, Roger Fletcher

claim paper

Read More »

« Prev « First page 451 / 1402 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers