Search Sciweavers | Sciweavers

656 search results - page 102 / 132

» Complexity of finite-horizon Markov decision process problem...

144

Voted

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

16 years 2 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

123

click to vote

GLOBECOM
2006
IEEE

160views Communications» more GLOBECOM 2006»

Adaptive Learning of Transmission Control Policies for MIMO Fading Channels under Delay Constraint

15 years 8 months ago

Download www.ece.ubc.ca

— This paper addresses learning based adaptive resource allocation for wireless MIMO channels with Markovian fading. The problem is posed as Constrained Markov Decision Process w...

Dejan V. Djonin, Vikram Krishnamurthy

claim paper

Read More »

118

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

Symbolic Dynamic Programming for First-order POMDPs

15 years 3 months ago

Download www-kd.iai.uni-bonn.de

Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...

Scott Sanner, Kristian Kersting

claim paper

Read More »

112

Voted

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 3 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

125

click to vote

GLOBECOM
2010
IEEE

151views Communications» more GLOBECOM 2010»

Cooperation Stimulation in Cognitive Networks Using Indirect Reciprocity Game Modelling

14 years 12 months ago

Download www.cspl.umd.edu

In cognitive networks, since nodes generally belong to different authorities and pursue different goals, they will not cooperate with others unless cooperation can improve their ow...

Yan Chen, K. J. Ray Liu

claim paper

Read More »

« Prev « First page 102 / 132 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers