Search Sciweavers | Sciweavers

146

ML
2002
ACM

100views Machine Learning» more ML 2002»

Structure in the Space of Value Functions

15 years 5 months ago

Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...

David J. Foster, Peter Dayan

claim paper

Read More »

149

click to vote

ML
2002
ACM

121views Machine Learning» more ML 2002»

Near-Optimal Reinforcement Learning in Polynomial Time

15 years 5 months ago

Download www.cis.upenn.edu

We present new algorithms for reinforcement learning, and prove that they have polynomial bounds on the resources required to achieve near-optimal return in general Markov decisio...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

175

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 5 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

151

Voted

MP
2002

84views more MP 2002»

A decomposition procedure based on approximate Newton directions

15 years 5 months ago

Download docubib.uc3m.es

The efficient solution of large-scale linear and nonlinear optimization problems may require exploiting any special structure in them in an efficient manner. We describe and analy...

Antonio J. Conejo, Francisco J. Nogales, Francisco...

claim paper

Read More »

131

click to vote

MP
2002

85views more MP 2002»

Generalized Goal Programming: polynomial methods and applications

15 years 5 months ago

Download www.optimization-online.org

In this paper we address a general Goal Programming problem with linear objectives, convex constraints, and an arbitrary componentwise nondecreasing norm to aggregate deviations w...

Emilio Carrizosa, Jörg Fliege

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers