Sciweavers

1233 search results - page 163 / 247
» Reinforcement learning
Sort
View
ATAL
2008
Springer
15 years 5 months ago
Expediting RL by using graphical structures
The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...
Peng Dai, Alexander L. Strehl, Judy Goldsmith
ICML
2010
IEEE
15 years 1 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
AIIDE
2008
15 years 6 months ago
Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games
We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...
Maria Cutumisu, Duane Szafron, Michael H. Bowling,...
DIGITEL
2008
IEEE
15 years 5 months ago
Adaptive Educational Games: Providing Non-invasive Personalised Learning Experiences
Educational games have the potential to provide intrinsically motivating learning experiences that immerse and engage the learner. However, the much heralded benefits of education...
Neil Peirce, Owen Conlan, Vincent Wade
ACMICEC
2007
ACM
102views ECommerce» more  ACMICEC 2007»
15 years 7 months ago
Learning to trade with insider information
This paper introduces algorithms for learning how to trade using insider (superior) information in Kyle's model of financial markets. Prior results in finance theory relied o...
Sanmay Das