Sciweavers

2566 search results - page 5 / 514
» Relating reinforcement learning performance to classificatio...
Sort
View
ISCA
2008
IEEE
137views Hardware» more  ISCA 2008»
15 years 6 months ago
Self-Optimizing Memory Controllers: A Reinforcement Learning Approach
Efficiently utilizing off-chip DRAM bandwidth is a critical issue in designing cost-effective, high-performance chip multiprocessors (CMPs). Conventional memory controllers deli...
Engin Ipek, Onur Mutlu, José F. Martí...
89
Voted
IJCAI
2003
15 years 1 months ago
Covariant Policy Search
We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...
J. Andrew Bagnell, Jeff G. Schneider
CORR
1998
Springer
164views Education» more  CORR 1998»
14 years 11 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
AUSAI
2005
Springer
15 years 5 months ago
Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning
: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...
Peter Vamplew, Robert Ollington
ICTAI
2007
IEEE
15 years 6 months ago
Multi-agent Reinforcement Learning Using Strategies and Voting
Multiagent learning attracts much attention in the past few years as it poses very challenging problems. Reinforcement Learning is an appealing solution to the problems that arise...
Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...