Search Sciweavers | Sciweavers

120 search results - page 23 / 24

» Hierarchical Solution of Markov Decision Processes using Mac...

click to vote

CPAIOR
2009
Springer

95views Operations Research» more CPAIOR 2009»

Optimal Interdiction of Unreactive Markovian Evaders

14 years 10 days ago

Download math.lanl.gov

The interdiction problem arises in a variety of areas including military logistics, infectious disease control, and counter-terrorism. In the typical formulation of network interdi...

Alexander Gutfraind, Aric A. Hagberg, Feng Pan

claim paper

Read More »

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

13 years 12 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

click to vote

ATAL
2003
Springer

185views Intelligent Agents» more ATAL 2003»

Optimizing information exchange in cooperative multi-agent systems

13 years 11 months ago

Download rbr.cs.umass.edu

Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

13 years 7 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

click to vote

CORR
2008
Springer

173views Education» more CORR 2008»

Decomposition Principles and Online Learning in Cross-Layer Optimization for Delay-Sensitive Applications

13 years 5 months ago

Download documents.scribd.com

In this paper, we propose a general cross-layer optimization framework in which we explicitly consider both the heterogeneous and dynamically changing characteristics of delay-sens...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

« Prev « First page 23 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers