Search Sciweavers | Sciweavers

332 search results - page 42 / 67

» Ranking policies in discrete Markov decision processes

129

Voted

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 2 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

134

Voted

CODES
2009
IEEE

178views Software Engineering» more CODES 2009»

An MDP-based application oriented optimal policy for wireless sensor networks

15 years 5 months ago

Download www.ann.ece.ufl.edu

Technological advancements due to Moore’s law have led to the proliferation of complex wireless sensor network (WSN) domains. One commonality across all WSN domains is the need ...

Arslan Munir, Ann Gordon-Ross

claim paper

Read More »

126

Voted

INFOCOM
2012
IEEE

189views Communications» more INFOCOM 2012»

Approximately optimal adaptive learning in opportunistic spectrum access

13 years 4 months ago

Download web.eecs.umich.edu

—In this paper we develop an adaptive learning algorithm which is approximately optimal for an opportunistic spectrum access (OSA) problem with polynomial complexity. In this OSA...

Cem Tekin, Mingyan Liu

claim paper

Read More »

124

click to vote

ATAL
2005
Springer

146views Intelligent Agents» more ATAL 2005»

Exploiting belief bounds: practical POMDPs for personal assistant agents

15 years 7 months ago

Download teamcore.usc.edu

Agents or agent teams deployed to assist humans often face the challenges of monitoring the state of key processes in their environment (including the state of their human users t...

Pradeep Varakantham, Rajiv T. Maheswaran, Milind T...

claim paper

Read More »

109

click to vote

AAAI
2006

134views Intelligent Agents» more AAAI 2006»

Point-based Dynamic Programming for DEC-POMDPs

15 years 3 months ago

Download hal.archives-ouvertes.fr

We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...

Daniel Szer, François Charpillet

claim paper

Read More »

« Prev « First page 42 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers