Search Sciweavers | Sciweavers

1277 search results - page 180 / 256

» Terminating Decision Algorithms Optimally

125

Voted

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 5 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

119

click to vote

MMNS
2004

106views Multimedia» more MMNS 2004»

Content-Based Adaptation of Streamed Multimedia

15 years 5 months ago

Download pel.ucd.ie

Most adaptive delivery mechanisms for streaming multimedia content do not explicitly consider user-perceived quality when making adaptation decisions. We show that an optimal adap...

Nicola Cranley, Liam Murphy, Philip Perry

claim paper

Read More »

127

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

125

click to vote

ASAP
2010
IEEE

185views Hardware» more ASAP 2010»

ImpEDE: A multidimensional design-space exploration framework for biomedical-implant processors

15 years 4 months ago

Download ce.et.tudelft.nl

Abstract—The demand for biomedical implants keeps increasing. However, most of the current implant design methodologies involve custom-ASIC design. The SiMS project aims to chang...

Dhara Dave, Christos Strydis, Georgi Gaydadjiev

claim paper

Read More »

134

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

14 years 10 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

« Prev « First page 180 / 256 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers