Search Sciweavers | Sciweavers

30 search results - page 6 / 6

» Model-Based Average Reward Reinforcement Learning

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

13 years 11 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

JMLR
2010

119views more JMLR 2010»

A Convergent Online Single Time Scale Actor Critic Algorithm

12 years 11 months ago

Download jmlr.csail.mit.edu

Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...

Dotan Di Castro, Ron Meir

claim paper

Read More »

click to vote

ICAC
2005
IEEE

108views Applied Computing» more ICAC 2005»

Self-Optimizing Architecture for QoS Provisioning in Differentiated Services

13 years 10 months ago

Download csdl2.computer.org

This paper presents a scalable and self-optimizing architecture for Quality-of-Service (QoS) provisioning in the Differentiated Services (DiffServ) framework. The proposed archite...

Daniel Yagan, Chen-Khong Tham

claim paper

Read More »

click to vote

ATAL
2003
Springer

172views Intelligent Agents» more ATAL 2003»

Resource allocation games with changing resource capacities

13 years 10 months ago

Download www.isi.edu

In this paper we study a class of resource allocation games which are inspired by the El Farol Bar problem. We consider a system of competitive agents that have to choose between ...

Aram Galstyan, Shashikiran Kolar, Kristina Lerman

claim paper

Read More »

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

13 years 6 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

« Prev « First page 6 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers