Search Sciweavers | Sciweavers

771 search results - page 132 / 155

» Markov Decision Processes with Arbitrary Reward Processes

117

Voted

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Combinatorial resource scheduling for multiagent MDPs

15 years 8 months ago

Download ai.stanford.edu

Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...

Dmitri A. Dolgov, Michael R. James, Michael E. Sam...

claim paper

Read More »

110

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 8 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

142

click to vote

ROBOCUP
2007
Springer

99views Robotics» more ROBOCUP 2007»

Instance-Based Action Models for Fast Action Planning

15 years 8 months ago

Download userweb.cs.utexas.edu

Abstract. Two main challenges of robot action planning in real domains are uncertain action eﬀects and dynamic environments. In this paper, an instance-based action model is lear...

Mazda Ahmadi, Peter Stone

claim paper

Read More »

123

click to vote

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

15 years 7 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

141

click to vote

QEST
2006
IEEE

143views Modeling and Simulation» more QEST 2006»

Compositional Performability Evaluation for STATEMATE

15 years 7 months ago

Download ftp.inrialpes.fr

Abstract— This paper reports on our efforts to link an industrial state-of-the-art modelling tool to academic state-of-the-art analysis algorithms. In a nutshell, we enable timed...

Eckard Böde, Marc Herbstritt, Holger Hermanns...

claim paper

Read More »

« Prev « First page 132 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers