partially observable markov decision process

14

GLOBECOM
2007
IEEE

134views Communications» more GLOBECOM 2007»

Bursty Traffic in Energy-Constrained Opportunistic Spectrum Access

13 years 8 months ago

We design opportunistic spectrum access strategies for improving spectrum efficiency. In each slot, a secondary user chooses a subset of channels to sense and decides whether to ac...

Yunxia Chen, Qing Zhao, Ananthram Swami

claim paper

Read More »

9

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

13 years 9 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

13

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

13 years 10 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

16

click to vote

IAT
2005
IEEE

132views Intelligent Agents» more IAT 2005»

Decomposing Large-Scale POMDP Via Belief State Analysis

13 years 10 months ago

Download www.comp.hkbu.edu.hk

Partially observable Markov decision process (POMDP) is commonly used to model a stochastic environment with unobservable states for supporting optimal decision making. Computing ...

Xin Li, William K. Cheung, Jiming Liu

claim paper

Read More »

8

click to vote

ICRA
2007
IEEE

134views Robotics» more ICRA 2007»

Grasping POMDPs

13 years 11 months ago

Download people.csail.mit.edu

Abstract— We provide a method for planning under uncertainty for robotic manipulation by partitioning the conﬁguration space into a set of regions that are closed under complia...

Kaijen Hsiao, Leslie Pack Kaelbling, Tomás ...

claim paper

Read More »

13

click to vote

ICRA
2007
IEEE

154views Robotics» more ICRA 2007»

Oracular Partially Observable Markov Decision Processes: A Very Special Case

13 years 11 months ago

Download www.cs.cmu.edu

— We introduce the Oracular Partially Observable Markov Decision Process (OPOMDP), a type of POMDP in which the world produces no observations; instead there is an “oracle,” ...

Nicholas Armstrong-Crews, Manuela M. Veloso

claim paper

Read More »

12

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

13 years 11 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

15

click to vote

ICC
2007
IEEE

121views Communications» more ICC 2007»

Structure and Optimality of Myopic Sensing for Opportunistic Spectrum Access

13 years 11 months ago

Download www.ece.ucdavis.edu

We consider opportunistic spectrum access for secondary users over multiple channels whose occupancy by primary users is modeled as discrete-time Markov processes. Due to hardware...

Qing Zhao, Bhaskar Krishnamachari

claim paper

Read More »

10

click to vote

DATE
2007
IEEE

133views Hardware» more DATE 2007»

Stochastic modeling and optimization for robust power management in a partially observable system

13 years 11 months ago

Download www.date-conference.com

As the hardware and software complexity grows, it is unlikely for the power management hardware/software to have a full observation of the entire system status. In this paper, we ...

Qinru Qiu, Ying Tan, Qing Wu

claim paper

Read More »

17

click to vote

ICC
2008
IEEE

169views Communications» more ICC 2008»

Optimality of Myopic Sensing in Multi-Channel Opportunistic Access

13 years 11 months ago

Download www.ece.ucdavis.edu

—We consider opportunistic communications over multiple channels where the state (“good” or “bad”) of each channel evolves as independent and identically distributed Mark...

Tara Javidi, Bhaskar Krishnamachari, Qing Zhao, Mi...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers