Search Sciweavers | Sciweavers

288 search results - page 30 / 58

» Risk-averse dynamic programming for Markov decision processe...

115

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

14 years 11 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

125

click to vote

GLOBECOM
2010
IEEE

189views Communications» more GLOBECOM 2010»

Need-Based Communication for Smart Grid: When to Inquire Power Price?

14 years 11 months ago

Download iweb.tntech.edu

In smart grid, a home appliance can adjust its power consumption level according to the realtime power price obtained from communication channels. Most studies on smart grid do not...

Husheng Li, Robert C. Qiu

claim paper

Read More »

120

click to vote

CDC
2010
IEEE

136views Control Systems» more CDC 2010»

Pathologies of temporal difference methods in approximate dynamic programming

14 years 8 months ago

Download web.mit.edu

Approximate policy iteration methods based on temporal differences are popular in practice, and have been tested extensively, dating to the early nineties, but the associated conve...

Dimitri P. Bertsekas

claim paper

Read More »

121

click to vote

CORR
2010
Springer

103views Education» more CORR 2010»

Structural Solutions to Dynamic Scheduling for Multimedia Transmission in Unknown Wireless Environments

14 years 12 months ago

Download medianetlab.ee.ucla.edu

In this paper, we propose a systematic solution to the problem of scheduling delay-sensitive media data for transmission over time-varying wireless channels. We first formulate th...

Fangwen Fu, Mihaela van der Schaar

claim paper

Read More »

105

click to vote

CORR
2007
Springer

94views Education» more CORR 2007»

Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm

15 years 1 months ago

Download www.ieee-infocom.org

— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...

Bruce Hajek, Kevin Mitzel, Sichao Yang

claim paper

Read More »

« Prev « First page 30 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers