Search Sciweavers | Sciweavers

238 search results - page 31 / 48

» Value-Function Approximations for Partially Observable Marko...

109

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 11 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

163

click to vote

ICN
2007
Springer

97views Computer Networks» more ICN 2007»

Heuristic Approach of Optimal Code Allocation in High Speed Downlink Packet Access Networks

15 years 11 months ago

Download www.sce.carleton.ca

— In this paper, we use the Markov Decision Process (MDP) technique to ﬁnd the optimal code allocation policy in High-Speed Downlink Packet Access (HSDPA) networks. A discrete ...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

158

click to vote

ICASSP
2008
IEEE

163views Signal Processing» more ICASSP 2008»

Link throughput of multi-channel opportunistic access with limited sensing

15 years 12 months ago

Download www.ece.ucdavis.edu

—We aim to characterize the maximum link throughput of a multi-channel opportunistic communication system. The states of these channels evolve as independent and identically dist...

Keqin Liu, Qing Zhao

claim paper

Read More »

169

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Transfer via soft homomorphisms

16 years 1 hour ago

Download www.eecs.umich.edu

The ﬁeld of transfer learning aims to speed up learning across multiple related tasks by transferring knowledge between source and target tasks. Past work has shown that when th...

Jonathan Sorg, Satinder Singh

claim paper

Read More »

133

click to vote

ICRA
2008
IEEE

167views Robotics» more ICRA 2008»

An approximate algorithm for solving oracular POMDPs

15 years 12 months ago

Download www.cs.cmu.edu

Abstract— We propose a new approximate algorithm, LAJIV (Lookahead J-MDP Information Value), to solve Oracular Partially Observable Markov Decision Problems (OPOMDPs), a special ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 31 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers