Search Sciweavers | Sciweavers

371 search results - page 62 / 75

» The Complexity of Decentralized Control of Markov Decision P...

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Combinatorial resource scheduling for multiagent MDPs

15 years 6 months ago

Download ai.stanford.edu

Optimal resource scheduling in multiagent systems is a computationally challenging task, particularly when the values of resources are not additive. We consider the combinatorial ...

Dmitri A. Dolgov, Michael R. James, Michael E. Sam...

claim paper

Read More »

click to vote

EENERGY
2010

150views Computer Networks» more EENERGY 2010»

Optimal sleep patterns for serving delay-tolerant jobs

15 years 3 months ago

Download www.princeton.edu

Sleeping is an important method to reduce energy consumption in many information and communication systems. In this paper we focus on a typical server under dynamic load, where en...

Ioannis Kamitsos, Lachlan L. H. Andrew, Hongseok K...

claim paper

Read More »

118

click to vote

CN
2002

127views more CN 2002»

Optimal policy for label switched path setup in MPLS networks

14 years 11 months ago

Download www.ece.iit.edu

An important aspect in designing a multiprotocol label switching (MPLS) network is to determine an initial topology and to adapt it to the traffic load. A topology change in an MP...

Tricha Anjali, Caterina M. Scoglio, Jaudelice Cava...

claim paper

Read More »

Voted

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 17 days ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

15 years 6 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

« Prev « First page 62 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers