Sciweavers

1233 search results - page 93 / 247
» Reinforcement Learning in MirrorBot
Sort
View
148
Voted
IJCAI
2001
15 years 5 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso
136
Voted
ATAL
2008
Springer
15 years 5 months ago
Sequential decision making with untrustworthy service providers
In this paper, we deal with the sequential decision making problem of agents operating in computational economies, where there is uncertainty regarding the trustworthiness of serv...
W. T. Luke Teacy, Georgios Chalkiadakis, Alex Roge...
131
Voted
JAIR
2011
144views more  JAIR 2011»
14 years 10 months ago
Non-Deterministic Policies in Markovian Decision Processes
Markovian processes have long been used to model stochastic environments. Reinforcement learning has emerged as a framework to solve sequential planning and decision-making proble...
Mahdi Milani Fard, Joelle Pineau
138
Voted
GECCO
2004
Springer
122views Optimization» more  GECCO 2004»
15 years 9 months ago
Gradient-Based Learning Updates Improve XCS Performance in Multistep Problems
This paper introduces a gradient-based reward prediction update mechanism to the XCS classifier system as applied in neuralnetwork type learning and function approximation mechani...
Martin V. Butz, David E. Goldberg, Pier Luca Lanzi
102
Voted
ICML
1997
IEEE
16 years 4 months ago
Exponentiated Gradient Methods for Reinforcement Learning
Doina Precup, Richard S. Sutton