Sciweavers

168 search results - page 6 / 34
» Optimism in Reinforcement Learning Based on Kullback-Leibler...
Sort
View
WSC
2007
14 years 11 months ago
Optimizing time warp simulation with reinforcement learning techniques
Adaptive Time Warp protocols in the literature are usually based on a pre-defined analytic model of the system, expressed as a closed form function that maps system state to cont...
Jun Wang, Carl Tropper
ICML
1996
IEEE
15 years 1 months ago
A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning
This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...
Rémi Munos
ECML
2005
Springer
15 years 3 months ago
Active Learning for Probability Estimation Using Jensen-Shannon Divergence
Active selection of good training examples is an important approach to reducing data-collection costs in machine learning; however, most existing methods focus on maximizing classi...
Prem Melville, Stewart M. Yang, Maytal Saar-Tsecha...
GECCO
2004
Springer
122views Optimization» more  GECCO 2004»
15 years 2 months ago
Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems
This paper introduces a gradient-based reward prediction update mechanism to the XCS classifier system as applied in neuralnetwork type learning and function approximation mechani...
Martin V. Butz, David E. Goldberg, Pier Luca Lanzi
ICML
1997
IEEE
15 years 10 months ago
Hierarchical Explanation-Based Reinforcement Learning
Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...
Prasad Tadepalli, Thomas G. Dietterich