Search Sciweavers | Sciweavers

38 search results - page 7 / 8

» Improving reinforcement learning function approximators via ...

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

13 years 4 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

click to vote

ICAC
2006
IEEE

112views Applied Computing» more ICAC 2006»

A Hybrid Reinforcement Learning Approach to Autonomic Resource Allocation

13 years 11 months ago

Download userweb.cs.utexas.edu

— Reinforcement Learning (RL) provides a promising new approach to systems performance management that differs radically from standard queuing-theoretic approaches making use of ...

Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mo...

claim paper

Read More »

click to vote

GECCO
2009
Springer

82views Optimization» more GECCO 2009»

On the scalability of XCS(F)

13 years 11 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Many successful applications have proven the potential of Learning Classiﬁer Systems and the XCS classiﬁer system in particular in datamining, reinforcement learning, and func...

Patrick O. Stalph, Martin V. Butz, David E. Goldbe...

claim paper

Read More »

click to vote

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

13 years 10 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

click to vote

ICMLA
2009

162views Machine Learning» more ICMLA 2009»

Transformation Learning Via Kernel Alignment

13 years 2 months ago

Download www.cs.columbia.edu

This article proposes an algorithm to automatically learn useful transformations of data to improve accuracy in supervised classification tasks. These transformations take the for...

Andrew Howard, Tony Jebara

claim paper

Read More »

« Prev « First page 7 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers