Search Sciweavers | Sciweavers

1325 search results - page 41 / 265

» Algorithm Selection using Reinforcement Learning

166

click to vote

GLOBECOM
2010
IEEE

132views Communications» more GLOBECOM 2010»

Reinforcement Learning for Link Adaptation in MIMO-OFDM Wireless Systems

15 years 3 months ago

Download users.ece.utexas.edu

Machine learning algorithms have recently attracted much interest for effective link adaptation due to their flexibility and ability to capture more environmental effects implicitl...

Sungho Yun, Constantine Caramanis

claim paper

Read More »

172

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

16 years 3 days ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

238

click to vote

NIPS
2008

149views Information Technology» more NIPS 2008»

Optimization on a Budget: A Reinforcement Learning Approach

15 years 7 months ago

Download www.cs.arizona.edu

Many popular optimization algorithms, like the Levenberg-Marquardt algorithm (LMA), use heuristic-based "controllers" that modulate the behavior of the optimizer during ...

Paul Ruvolo, Ian R. Fasel, Javier R. Movellan

claim paper

Read More »

161

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 4 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

149

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 7 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 41 / 265 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers