Search Sciweavers | Sciweavers

56 search results - page 11 / 12

» Reinforcement Learning for Average Reward Zero-Sum Games

136

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

14 years 8 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

101

Voted

ICAC
2005
IEEE

108views Applied Computing» more ICAC 2005»

Self-Optimizing Architecture for QoS Provisioning in Differentiated Services

15 years 7 months ago

Download csdl2.computer.org

This paper presents a scalable and self-optimizing architecture for Quality-of-Service (QoS) provisioning in the Differentiated Services (DiffServ) framework. The proposed archite...

Daniel Yagan, Chen-Khong Tham

claim paper

Read More »

123

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Planning against fictitious players in repeated normal form games

15 years 2 months ago

Download www.aamas-conference.org

Planning how to interact against bounded memory and unbounded memory learning opponents needs different treatment. Thus far, however, work in this area has shown how to design pla...

Enrique Munoz de Cote, Nicholas R. Jennings

claim paper

Read More »

126

click to vote

ANSS
2001
IEEE

143views Modeling and Simulation» more ANSS 2001»

Simulation-Based Engineering of Complex Adaptive Systems Using a Classifier Block

15 years 5 months ago

Download www.eforell.com

A Complex Adaptive System (CAS) is a network of communicating, intelligent agents where each agent adapts its behavior in order to collaborate with other agents to achieve overall...

John R. Clymer, David J. Chen

claim paper

Read More »

125

Voted

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

15 years 3 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

« Prev « First page 11 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers