Search Sciweavers | Sciweavers

1233 search results - page 163 / 247

» Reinforcement learning

119

click to vote

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 5 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

150

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 1 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

118

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

15 years 6 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

155

click to vote

DIGITEL
2008
IEEE

236views Artificial Intelligence» more DIGITEL 2008»

Adaptive Educational Games: Providing Non-invasive Personalised Learning Experiences

15 years 5 months ago

Download www.mendeley.com

Educational games have the potential to provide intrinsically motivating learning experiences that immerse and engage the learner. However, the much heralded benefits of education...

Neil Peirce, Owen Conlan, Vincent Wade

claim paper

Read More »

119

click to vote

ACMICEC
2007
ACM

102views ECommerce» more ACMICEC 2007»

Learning to trade with insider information

15 years 7 months ago

Download www.cs.rpi.edu

This paper introduces algorithms for learning how to trade using insider (superior) information in Kyle's model of financial markets. Prior results in finance theory relied o...

Sanmay Das

claim paper

Read More »

« Prev « First page 163 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers