Search Sciweavers | Sciweavers

190 search results - page 35 / 38

» An Incremental Sampling-based Algorithm for Stochastic Optim...

209

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 7 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

172

click to vote

INFOCOM
2009
IEEE

184views Communications» more INFOCOM 2009»

Keep Cache Replacement Simple in Peer-Assisted VoD Systems

16 years 1 months ago

Download www.eecg.toronto.edu

—Peer-assisted Video-on-Demand (VoD) systems have not only received substantial recent research attention, but also been implemented and deployed with success in large-scale real...

Jiahua Wu, Baochun Li

claim paper

Read More »

219

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 8 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

176

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

15 years 8 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

213

click to vote

MOBICOM
2003
ACM

140views Communications» more MOBICOM 2003»

Minimum energy disjoint path routing in wireless ad-hoc networks

16 years 2 days ago

Download web.mit.edu

We develop algorithms for ﬁnding minimum energy disjoint paths in an all-wireless network, for both the node and linkdisjoint cases. Our major results include a novel polynomial...

Anand Srinivas, Eytan Modiano

claim paper

Read More »

« Prev « First page 35 / 38 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers