Search Sciweavers | Sciweavers

3038 search results - page 32 / 608

» High-Speed Function Approximation

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

14 years 11 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

101

click to vote

JAIR
2008

126views more JAIR 2008»

Optimal and Approximate Q-value Functions for Decentralized POMDPs

14 years 9 months ago

Download www.jair.org

Decision-theoretic planning is a popular approach to sequential decision making problems, because it treats uncertainty in sensing and acting in a principled way. In single-agent ...

Frans A. Oliehoek, Matthijs T. J. Spaan, Nikos A. ...

claim paper

Read More »

click to vote

NIPS
1994

178views Information Technology» more NIPS 1994»

Generalization in Reinforcement Learning: Safely Approximating the Value Function

14 years 11 months ago

Download www.ri.cmu.edu

To appear in: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge MA, 1995. A straightforward approach to t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

14 years 10 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

15 years 10 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

« Prev « First page 32 / 608 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers