Search Sciweavers | Sciweavers

63 search results - page 13 / 13

» Mean field for Markov Decision Processes: from Discrete to C...

click to vote

INFOCOM
2009
IEEE

153views Communications» more INFOCOM 2009»

Delay-Optimal Opportunistic Scheduling and Approximations: The Log Rule

13 years 11 months ago

Download users.ece.utexas.edu

—This paper considers the design of opportunistic packet schedulers for users sharing a time-varying wireless channel from the performance and the robustness points of view. Firs...

Bilal Sadiq, Seung Jun Baek, Gustavo de Veciana

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 5 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

JDCTA
2010

146views more JDCTA 2010»

Modelling for Cruise Two-Dimensional Online Revenue Management System

12 years 11 months ago

Download www.aicit.org

To solve the cruise two-dimensional revenue management problem and develop such an automated system under uncertain environment, a static model which is a stochastic integer progr...

Bingzhou Li

claim paper

Read More »

« Prev « First page 13 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers