Search Sciweavers | Sciweavers

771 search results - page 45 / 155

» Markov Decision Processes with Arbitrary Reward Processes

189

click to vote

SOCO
2010
Springer

148views Software Engineering» more SOCO 2010»

Using evolution strategies to solve DEC-POMDP problems

15 years 10 days ago

Download www.springerlink.com

Decentralized partially observable Markov decision process (DEC-POMDP) is an approach to model multi-robot decision making problems under uncertainty. Since it is NEXP-complete the...

Baris Eker, H. Levent Akin

claim paper

Read More »

145

click to vote

CORR
2010
Springer

101views Education» more CORR 2010»

Finite Optimal Control for Time-Bounded Reachability in CTMDPs and Continuous-Time Markov Games

15 years 5 months ago

Download react.cs.uni-sb.de

We establish the existence of optimal scheduling strategies for time-bounded reachability in continuous-time Markov decision processes, and of co-optimal strategies for continuous-...

Markus Rabe, Sven Schewe

claim paper

Read More »

173

click to vote

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 7 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

185

click to vote

ICASSP
2011
IEEE

177views Signal Processing» more ICASSP 2011»

Logarithmic weak regret of non-Bayesian restless multi-armed bandit

14 years 9 months ago

Download www.ece.ucdavis.edu

Abstract—We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics. At each time, a player chooses K out of N (N > K) arms to play. The state of each ar...

Haoyang Liu, Keqin Liu, Qing Zhao

claim paper

Read More »

158

click to vote

CAV
2010
Springer

190views Hardware» more CAV 2010»

Measuring and Synthesizing Systems in Probabilistic Environments

15 years 9 months ago

Download www-verimag.imag.fr

Often one has a preference order among the different systems that satisfy a given specification. Under a probabilistic assumption about the possible inputs, such a preference order...

Krishnendu Chatterjee, Thomas A. Henzinger, Barbar...

claim paper

Read More »

« Prev « First page 45 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers