Sciweavers

682 search results - page 116 / 137
» One-Counter Markov Decision Processes
Sort
View
94
Voted
GLOBECOM
2007
IEEE
15 years 7 months ago
Cross-Layer Call Admission Control for a CDMA Uplink Employing a Base-Station Antenna Array
— A novel cross-layer call admission control policy is proposed for a general CDMA beamforming system. In contrast to previously proposed call admission control (CAC) policies wh...
Wei Sheng, Steven D. Blostein
102
Voted
GLOBECOM
2007
IEEE
15 years 7 months ago
Constrained Stochastic Games in Wireless Networks
—We consider the situation where N nodes share a common access point. With each node i there is an associated buffer and channel state that change in time. Node i dynamically cho...
Eitan Altaian, Konstantin Avrachenkov, Nicolas Bon...
100
Voted
ATAL
2007
Springer
15 years 7 months ago
Combinatorial resource scheduling for multiagent MDPs
Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...
Dmitri A. Dolgov, Michael R. James, Michael E. Sam...
102
Voted
ECML
2007
Springer
15 years 6 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
128
Voted
ROBOCUP
2007
Springer
99views Robotics» more  ROBOCUP 2007»
15 years 6 months ago
Instance-Based Action Models for Fast Action Planning
Abstract. Two main challenges of robot action planning in real domains are uncertain action effects and dynamic environments. In this paper, an instance-based action model is lear...
Mazda Ahmadi, Peter Stone