Search Sciweavers | Sciweavers

567 search results - page 44 / 114

» Regularized Policy Iteration

141

click to vote

TIT
2008

110views more TIT 2008»

Optimal Cross-Layer Scheduling of Transmissions Over a Fading Multiaccess Channel

15 years 3 months ago

Download ece.iisc.ernet.in

We consider the problem of several users transmitting packets to a base station, and study an optimal scheduling formulation involving three communication layers, namely, the mediu...

Munish Goyal, Anurag Kumar, Vinod Sharma

claim paper

Read More »

Voted

QUESTA
2000

56views more QUESTA 2000»

On the value function of a priority queue with an application to a controlled polling model

15 years 3 months ago

Download www.math.vu.nl

We give a closed-form expression for the discounted weighted queue length and switching costs of a two-class single-server queueing model under a preemptive priority rule. These e...

Ger Koole, Philippe Nain

claim paper

Read More »

133

Voted

JMLR
2010

135views more JMLR 2010»

Finite-sample Analysis of Bellman Residual Minimization

14 years 10 months ago

Download jmlr.csail.mit.edu

We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is avai...

Odalric-Ambrym Maillard, Rémi Munos, Alessa...

claim paper

Read More »

157

click to vote

CIA
2007
Springer

143views Intelligent Agents» more CIA 2007»

Multi-agent Learning Dynamics: A Survey

15 years 10 months ago

Download michaelkaisers.com

Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...

H. Jaap van den Herik, Daniel Hennes, Michael Kais...

claim paper

Read More »

183

click to vote

ICTAI
2006
IEEE

110views Artificial Intelligence» more ICTAI 2006»

A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem

15 years 10 months ago

Download www.loria.fr

We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...

Lhassane Idoumghar, René Schott

claim paper

Read More »

« Prev « First page 44 / 114 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers