Search Sciweavers | Sciweavers

502 search results - page 63 / 101

» Monotone Approximation of Decision Problems

136

click to vote

STOC
1997
ACM

125views Algorithms» more STOC 1997»

An Interruptible Algorithm for Perfect Sampling via Markov Chains

15 years 9 months ago

Download www.mts.jhu.edu

For a large class of examples arising in statistical physics known as attractive spin systems (e.g., the Ising model), one seeks to sample from a probability distribution π on an...

James Allen Fill

claim paper

Read More »

155

click to vote

SIGECOM
2009
ACM

137views ECommerce» more SIGECOM 2009»

An exact almost optimal algorithm for target set selection in social networks

15 years 11 months ago

Download www.ii.uib.no

The Target Set Selection problem proposed by Kempe, Kleinberg, and Tardos, gives a nice clean combinatorial formulation for many problems arising in economy, sociology, and medicin...

Oren Ben-Zwi, Danny Hermelin, Daniel Lokshtanov, I...

claim paper

Read More »

126

Voted

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 6 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

114

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 11 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

148

click to vote

APNOMS
2006
Springer

103views Computer Networks» more APNOMS 2006»

Network-Adaptive QoS Routing Using Local Information

15 years 8 months ago

Download www.apnoms.org

In this paper, we propose the localized adaptive QoS routing scheme using POMDP(partially observable Markov Decision Processes) and Exploration Bonus. In order to deal with POMDP p...

Jeongsoo Han

claim paper

Read More »

« Prev « First page 63 / 101 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers