Sciweavers

2005 search results - page 348 / 401
» Decisive Markov Chains
Sort
View
GLOBECOM
2007
IEEE
15 years 6 months ago
Cross-Layer Call Admission Control for a CDMA Uplink Employing a Base-Station Antenna Array
— A novel cross-layer call admission control policy is proposed for a general CDMA beamforming system. In contrast to previously proposed call admission control (CAC) policies wh...
Wei Sheng, Steven D. Blostein
GLOBECOM
2007
IEEE
15 years 6 months ago
Constrained Stochastic Games in Wireless Networks
—We consider the situation where N nodes share a common access point. With each node i there is an associated buffer and channel state that change in time. Node i dynamically cho...
Eitan Altaian, Konstantin Avrachenkov, Nicolas Bon...
ICC
2007
IEEE
15 years 6 months ago
Dynamic Lightpath Establishment for Service Differentiation Based on Optimal MDP Policy in All-Optical Networks with Wavelength
— In this paper, we propose a dynamic lightpath establishment method for service differentiation in all-optical WDM networks with the capability of full-range wavelength conversi...
Takuji Tachibana, Shoji Kasahara, Kenji Sugimoto
ATAL
2007
Springer
15 years 6 months ago
Combinatorial resource scheduling for multiagent MDPs
Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...
Dmitri A. Dolgov, Michael R. James, Michael E. Sam...
ECML
2007
Springer
15 years 6 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber