Sciweavers

56 search results - page 11 / 12
» Reinforcement Learning for Average Reward Zero-Sum Games
Sort
View
JMLR
2010
119views more  JMLR 2010»
13 years 1 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
ICAC
2005
IEEE
13 years 12 months ago
Self-Optimizing Architecture for QoS Provisioning in Differentiated Services
This paper presents a scalable and self-optimizing architecture for Quality-of-Service (QoS) provisioning in the Differentiated Services (DiffServ) framework. The proposed archite...
Daniel Yagan, Chen-Khong Tham
ATAL
2010
Springer
13 years 7 months ago
Planning against fictitious players in repeated normal form games
Planning how to interact against bounded memory and unbounded memory learning opponents needs different treatment. Thus far, however, work in this area has shown how to design pla...
Enrique Munoz de Cote, Nicholas R. Jennings
ANSS
2001
IEEE
13 years 10 months ago
Simulation-Based Engineering of Complex Adaptive Systems Using a Classifier Block
A Complex Adaptive System (CAS) is a network of communicating, intelligent agents where each agent adapts its behavior in order to collaborate with other agents to achieve overall...
John R. Clymer, David J. Chen
UAI
2003
13 years 7 months ago
On the Convergence of Bound Optimization Algorithms
Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...
Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...