Search Sciweavers | Sciweavers

829 search results - page 129 / 166

» A time aggregation approach to Markov decision processes

126

click to vote

DSN
2009
IEEE

131views Computer Networks» more DSN 2009»

RRE: A game-theoretic intrusion Response and Recovery Engine

14 years 11 months ago

Download netfiles.uiuc.edu

Preserving the availability and integrity of networked computing systems in the face of fast-spreading intrusions requires advances not only in detection algorithms, but also in a...

Saman A. Zonouz, Himanshu Khurana, William H. Sand...

claim paper

Read More »

102

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 2 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

15 years 8 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

131

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Towards Faster Planning with Continuous Resources in Stochastic Domains

15 years 4 months ago

Download www.aaai.org

Agents often have to construct plans that obey resource limits for continuous resources whose consumption can only be characterized by probability distributions. While Markov Deci...

Janusz Marecki, Milind Tambe

claim paper

Read More »

119

click to vote

JAIR
2006

101views more JAIR 2006»

Resource Allocation Among Agents with MDP-Induced Preferences

15 years 1 months ago

Download www.jair.org

Allocating scarce resources among agents to maximize global utility is, in general, computationally challenging. We focus on problems where resources enable agents to execute acti...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 129 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers