Search Sciweavers | Sciweavers

58 search results - page 7 / 12

» Fuzzy Approximation for Convergent Model-Based Reinforcement...

242

click to vote

IWLCS
2005
Springer

161views Machine Learning» more IWLCS 2005»

Counter Example for Q-Bucket-Brigade Under Prediction Problem

16 years 1 months ago

Download www.cs.bham.ac.uk

Aiming to clarify the convergence or divergence conditions for Learning Classiﬁer System (LCS), this paper explores: (1) an extreme condition where the reinforcement process of ...

Atsushi Wada, Keiki Takadama, Katsunori Shimohara

claim paper

Read More »

207

click to vote

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

16 years 28 days ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

185

click to vote

ACMICEC
2007
ACM

102views ECommerce» more ACMICEC 2007»

Learning to trade with insider information

15 years 11 months ago

Download www.cs.rpi.edu

This paper introduces algorithms for learning how to trade using insider (superior) information in Kyle's model of financial markets. Prior results in finance theory relied o...

Sanmay Das

claim paper

Read More »

210

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 10 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

200

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 8 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

« Prev « First page 7 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers