Sciweavers

1235 search results - page 146 / 247
» ABC Reinforcement Learning
Sort
View
118
Voted
EUSFLAT
2009
140views Fuzzy Logic» more  EUSFLAT 2009»
14 years 10 months ago
Incremental Possibilistic Approach for Online Clustering and Classification
In this paper, we propose to develop the supervised classification method Fuzzy Pattern Matching to be in addition a non supervised one. The goal is to monitor dynamic systems with...
Moamar Sayed Mouchaweh, Bernard Riera
92
Voted
ICML
2005
IEEE
16 years 1 months ago
Learning to compete, compromise, and cooperate in repeated general-sum games
Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...
Jacob W. Crandall, Michael A. Goodrich
111
Voted
AAAI
2007
15 years 3 months ago
Active Imitation Learning
Imitation learning, also called learning by watching or programming by demonstration, has emerged as a means of accelerating many reinforcement learning tasks. Previous work has s...
Aaron P. Shon, Deepak Verma, Rajesh P. N. Rao
119
Voted
ICML
2001
IEEE
16 years 1 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
97
Voted
ATAL
2004
Springer
15 years 6 months ago
Best-Response Multiagent Learning in Non-Stationary Environments
This paper investigates a relatively new direction in Multiagent Reinforcement Learning. Most multiagent learning techniques focus on Nash equilibria as elements of both the learn...
Michael Weinberg, Jeffrey S. Rosenschein