Search Sciweavers | Sciweavers

120 search results - page 24 / 24

» Hierarchical Solution of Markov Decision Processes using Mac...

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 6 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

13 years 10 months ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

click to vote

IPCCC
2007
IEEE

126views Communications» more IPCCC 2007»

Optimising Networks Against Malware

13 years 11 months ago

Download www.professeurs.polymtl.ca

Rapidly-spreading malicious software is an important threat on today’s computer networks. Most solutions that have been proposed to counter this threat are based on our ability ...

Pierre-Marc Bureau, José M. Fernandez

claim paper

Read More »

click to vote

ICDCS
2010
IEEE

167views Distributed And Parallel Com...» more ICDCS 2010»

Stochastic Steepest-Descent Optimization of Multiple-Objective Mobile Sensor Coverage

13 years 9 months ago

Download www.cs.purdue.edu

—We propose a steepest descent method to compute optimal control parameters for balancing between multiple performance objectives in stateless stochastic scheduling, wherein the ...

Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip, Na...

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 6 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 24 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers