Search Sciweavers | Sciweavers

97

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 25 days ago

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

119

click to vote

CEC
2011
IEEE

221views Artificial Intelligence» more CEC 2011»

Stochastic Natural Gradient Descent by estimation of empirical covariances

13 years 11 months ago

Download chrome.ws.dei.polimi.it

—Stochastic relaxation aims at ﬁnding the minimum of a ﬁtness function by identifying a proper sequence of distributions, in a given model, that minimize the expected value o...

Luigi Malagò, Matteo Matteucci, Giovanni Pi...

claim paper

Read More »

92

click to vote

ESANN
2007

148views Neural Networks» more ESANN 2007»

Applying the Episodic Natural Actor-Critic Architecture to Motor Primitive Learning

15 years 1 months ago

Download www.dice.ucl.ac.be

In this paper, we investigate motor primitive learning with the Natural Actor-Critic approach. The Natural Actor-Critic consists out of actor updates which are achieved using natur...

Jan Peters, Stefan Schaal

claim paper

Read More »

104

click to vote

NCI
2004

185views Neural Networks» more NCI 2004»

Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

15 years 1 months ago

Download staff.science.uva.nl

This paper describes a method for hierarchical reinforcement learning in which high-level policies automatically discover subgoals, and low-level policies learn to specialize for ...

Bram Bakker, Jürgen Schmidhuber

claim paper

Read More »

75

click to vote

PCI
2005
Springer

114views Information Technology» more PCI 2005»

TSIC: Thermal Scheduling Simulator for Chip Multiprocessors

15 years 5 months ago

Download www2.cs.ucy.ac.cy

Abstract. Increased power density, hot-spots, and temperature gradients are severe limiting factors for today’s state-of-the-art microprocessors. However, the ﬂexibility oﬀer...

Kyriakos Stavrou, Pedro Trancoso

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers