Search Sciweavers | Sciweavers

162 search results - page 27 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

125

Voted

CORR
2008
Springer

122views Education» more CORR 2008»

Strategy Improvement for Concurrent Safety Games

15 years 3 months ago

Download www.soe.ucsc.edu

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition ...

Krishnendu Chatterjee, Luca de Alfaro, Thomas A. H...

claim paper

Read More »

134

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 1 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

108

Voted

VTC
2007
IEEE

91views Communications» more VTC 2007»

Q-Learning-based Hybrid ARQ for High Speed Downlink Packet Access in UMTS

15 years 9 months ago

Download ntserver.cm.nctu.edu.tw

Abstract-In this paper, a Q-learning-based hybrid automatic repeat request (Q-HARQ) scheme is proposed to achieve efﬁcient resource utilization for high speed downlink packet acc...

Chung-Ju Chang, Chia-Yuan Chang, Fang-Ching Ren

claim paper

Read More »

121

click to vote

ATAL
2009
Springer

205views Intelligent Agents» more ATAL 2009»

Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs

15 years 9 months ago

Download www.aamas-conference.org

Recent scaling up of decentralized partially observable Markov decision process (DEC-POMDP) solvers towards realistic applications is mainly due to approximate methods. Of this fa...

Jilles Steeve Dibangoye, Abdel-Illah Mouaddib, Bra...

claim paper

Read More »

129

click to vote

JAIR
2006

101views more JAIR 2006»

Resource Allocation Among Agents with MDP-Induced Preferences

15 years 3 months ago

Download www.jair.org

Allocating scarce resources among agents to maximize global utility is, in general, computationally challenging. We focus on problems where resources enable agents to execute acti...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

« Prev « First page 27 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers