Sciweavers

567 search results - page 44 / 114
» Regularized Policy Iteration
Sort
View
TIT
2008
110views more  TIT 2008»
15 years 12 days ago
Optimal Cross-Layer Scheduling of Transmissions Over a Fading Multiaccess Channel
We consider the problem of several users transmitting packets to a base station, and study an optimal scheduling formulation involving three communication layers, namely, the mediu...
Munish Goyal, Anurag Kumar, Vinod Sharma
64
Voted
QUESTA
2000
56views more  QUESTA 2000»
15 years 8 days ago
On the value function of a priority queue with an application to a controlled polling model
We give a closed-form expression for the discounted weighted queue length and switching costs of a two-class single-server queueing model under a preemptive priority rule. These e...
Ger Koole, Philippe Nain
106
Voted
JMLR
2010
135views more  JMLR 2010»
14 years 7 months ago
Finite-sample Analysis of Bellman Residual Minimization
We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is avai...
Odalric-Ambrym Maillard, Rémi Munos, Alessa...
131
Voted
CIA
2007
Springer
15 years 6 months ago
Multi-agent Learning Dynamics: A Survey
Abstract. In this paper we compare state-of-the-art multi-agent reinforcement learning algorithms in a wide variety of games. We consider two types of algorithms: value iteration a...
H. Jaap van den Herik, Daniel Hennes, Michael Kais...
ICTAI
2006
IEEE
15 years 6 months ago
A New Hybrid GA-MDP Algorithm For The Frequency Assignment Problem
We propose a novel algorithm called GA-MDP for solving the frequency assigment problem. GA-MDP inherits the spirit of genetic algorithms with an adaptation of Markov Decision Proc...
Lhassane Idoumghar, René Schott