Search Sciweavers | Sciweavers

371 search results - page 58 / 75

» The Complexity of Decentralized Control of Markov Decision P...

102

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

15 years 6 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

109

click to vote

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

15 years 1 months ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

click to vote

EOR
2006

66views more EOR 2006»

Performance prediction of an unmanned airborne vehicle multi-agent system

14 years 11 months ago

Download www.damas.ift.ulaval.ca

Consider unmanned airborne vehicle (UAV) control agents in a dynamic multi-agent system. The agents must have a set of goals such as destination airport and intermediate positions...

Zhaotong Lian, Abhijit Deshmukh

claim paper

Read More »

124

click to vote

GECCO
2009
Springer

162views Optimization» more GECCO 2009»

Uncertainty handling CMA-ES for reinforcement learning

14 years 9 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...

Verena Heidrich-Meisner, Christian Igel

claim paper

Read More »

100

click to vote

CORR
2010
Springer

103views Education» more CORR 2010»

Structural Solutions to Dynamic Scheduling for Multimedia Transmission in Unknown Wireless Environments

14 years 10 months ago

Download medianetlab.ee.ucla.edu

In this paper, we propose a systematic solution to the problem of scheduling delay-sensitive media data for transmission over time-varying wireless channels. We first formulate th...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

« Prev « First page 58 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers