Sciweavers

9841 search results - page 179 / 1969
» Distributed Value Functions
Sort
View
DEDS
2010
97views more  DEDS 2010»
15 years 2 months ago
On Regression-Based Stopping Times
We study approaches that fit a linear combination of basis functions to the continuation value function of an optimal stopping problem and then employ a greedy policy based on the...
Benjamin Van Roy
ML
2002
ACM
154views Machine Learning» more  ML 2002»
15 years 2 months ago
Technical Update: Least-Squares Temporal Difference Learning
TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...
Justin A. Boyan
JGO
2010
531views more  JGO 2010»
15 years 29 days ago
Characterizing zero-derivative points
We study smooth functions in several variables with a Lipschitz derivative. It is shown that these functions have the “envelope property”: Around zero-derivative points, and on...
Sanjo Zlobec
122
Voted
ICDCN
2009
Springer
15 years 9 months ago
Self-similar Functions and Population Protocols: A Characterization and a Comparison
Chandy et al. proposed the methodology of “self-similar algorithms” for distributed computation in dynamic environments. We further characterize the class of functions computab...
Swapnil Bhatia, Radim Bartos
DCC
2006
IEEE
16 years 2 months ago
Evaluation codes and plane valuations
Abstract. We apply tools coming from singularity theory, as Hamburger-Noether expansions, and from valuation theory, as generating sequences, to explicitly describe order functions...
C. Galindo, M. Sanchis