Search Sciweavers | Sciweavers

508 search results - page 61 / 102

» Learning for stochastic dynamic programming

189

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning

16 years 8 months ago

Download reference.kfupm.edu.sa

Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...

Sridhar Mahadevan

claim paper

Read More »

196

Voted

ECAL
2005
Springer

119views Artificial Intelligence» more ECAL 2005»

The Quantitative Law of Effect is a Robust Emergent Property of an Evolutionary Algorithm for Reinforcement Learning

16 years 20 days ago

Download www.psychology.emory.edu

An evolutionary reinforcement-learning algorithm, the operation of which was not associated with an optimality condition, was instantiated in an artificial organism. The algorithm ...

J. J. McDowell, Zahra Ansari

claim paper

Read More »

198

click to vote

AAAI
1994

112views Intelligent Agents» more AAAI 1994»

Cost-Effective Sensing during Plan Execution

15 years 8 months ago

Download www.cs.arizona.edu

Between sensing the world after every action (as in a reactive plan) and not sensing at all (as in an openloop plan), lies a continuum of strategies for sensing during plan execut...

Eric A. Hansen

claim paper

Read More »

195

click to vote

ICMCS
2007
IEEE

149views Multimedia» more ICMCS 2007»

Joint Source Coding and Data Rate Adaptation for Multi-User Wireless Video Transmission

16 years 1 months ago

Download www.eecs.northwestern.edu

Much attention has been paid to the problem of optimally utilizing resources such as spectrum, power and time in order to achieve the best video delivery quality in wireless commu...

Fan Zhai, Zhu Li, Aggelos K. Katsaggelos

claim paper

Read More »

198

click to vote

ICN
2007
Springer

97views Computer Networks» more ICN 2007»

Heuristic Approach of Optimal Code Allocation in High Speed Downlink Packet Access Networks

16 years 1 months ago

Download www.sce.carleton.ca

— In this paper, we use the Markov Decision Process (MDP) technique to ﬁnd the optimal code allocation policy in High-Speed Downlink Packet Access (HSDPA) networks. A discrete ...

Hussein Al-Zubaidy, Jerome Talim, Ioannis Lambadar...

claim paper

Read More »

« Prev « First page 61 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers