Search Sciweavers | Sciweavers

98 search results - page 18 / 20

» Using Rewards for Belief State Updates in Partially Observab...

click to vote

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

12 years 5 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

14 years 6 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

HRI
2007
ACM

133views Human Computer Interaction» more HRI 2007»

Efficient model learning for dialog management

13 years 9 months ago

Download www.eecs.ucf.edu

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...

Finale Doshi, Nicholas Roy

claim paper

Read More »

click to vote

MOBICOM
2009
ACM

174views Communications» more MOBICOM 2009»

Interference management via rate splitting and HARQ over time-varying fading channels

14 years 8 days ago

Download web.njit.edu

The coexistence of two unlicensed links is considered, where one link interferes with the transmission of the other, over a timevarying, block-fading channel. In the absence of fa...

Marco Levorato, Osvaldo Simeone, Urbashi Mitra

claim paper

Read More »

click to vote

HICSS
2003
IEEE

207views Biometrics» more HICSS 2003»

Formalizing Multi-Agent POMDP's in the context of network routing

13 years 11 months ago

Download www.hicss.hawaii.edu

This paper uses partially observable Markov decision processes (POMDP’s) as a basic framework for MultiAgent planning. We distinguish three perspectives: ﬁrst one is that of a...

Bharaneedharan Rathnasabapathy, Piotr J. Gmytrasie...

claim paper

Read More »

« Prev « First page 18 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers