Search Sciweavers | Sciweavers

87 search results - page 17 / 18

» Dynamic Programming for Partially Observable Stochastic Game...

click to vote

ATAL
2009
Springer

205views Intelligent Agents» more ATAL 2009»

Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs

13 years 12 months ago

Download www.aamas-conference.org

Recent scaling up of decentralized partially observable Markov decision process (DEC-POMDP) solvers towards realistic applications is mainly due to approximate methods. Of this fa...

Jilles Steeve Dibangoye, Abdel-Illah Mouaddib, Bra...

claim paper

Read More »

click to vote

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

13 years 12 months ago

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over wh...

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

claim paper

Read More »

click to vote

CDC
2009
IEEE

124views Control Systems» more CDC 2009»

Inverse modeling for open boundary conditions in channel network

13 years 6 months ago

Download www.ce.berkeley.edu

Abstract-- An inverse modeling problem for systems governed by first-order, hyperbolic partial differential equations subject to periodic forcing is investigated. The problem is de...

Qingfang Wu, Mohammad Rafiee, Andrew Tinka, Alexan...

claim paper

Read More »

click to vote

IPPS
2010
IEEE

209views Distributed And Parallel Com...» more IPPS 2010»

Improving numerical reproducibility and stability in large-scale numerical simulations on GPUs

13 years 3 months ago

Download gcl.cis.udel.edu

The advent of general purpose graphics processing units (GPGPU's) brings about a whole new platform for running numerically intensive applications at high speeds. Their multi-...

Michela Taufer, Omar Padron, Philip Saponaro, Sand...

claim paper

Read More »

click to vote

ML
1998
ACM

101views Machine Learning» more ML 1998»

Elevator Group Control Using Multiple Reinforcement Learning Agents

13 years 5 months ago

Download www.clear.rice.edu

Recent algorithmic and theoretical advances in reinforcement learning (RL) have attracted widespread interest. RL algorithmshave appeared that approximatedynamic programming on an ...

Robert H. Crites, Andrew G. Barto

claim paper

Read More »

« Prev « First page 17 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers