Sciweavers

502 search results - page 52 / 101
» Monotone Approximation of Decision Problems
Sort
View
ICML
2001
IEEE
15 years 10 months ago
Off-Policy Temporal Difference Learning with Function Approximation
We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta
CDC
2008
IEEE
110views Control Systems» more  CDC 2008»
15 years 4 months ago
Multistage investments with recourse: A single-asset case with transaction costs
— We consider a financial decision problem involving dynamic investment decisions on a single risky instrument over multiple and discrete time periods. Investment returns are as...
Ufuk Topcu, Giuseppe Carlo Calafiore, Laurent El G...
78
Voted
ICCSA
2007
Springer
15 years 3 months ago
On Optimization of the Importance Weighted OWA Aggregation of Multiple Criteria
The problem of aggregating multiple numerical criteria to form overall objective functions is of considerable importance in many disciplines. The ordered weighted averaging (OWA) a...
Wlodzimierz Ogryczak, Tomasz Sliwinski
75
Voted
TCS
2008
14 years 9 months ago
Distance- k knowledge in self-stabilizing algorithms
Abstract. Many graph problems seem to require knowledge that extends beyond the immediate neighbors of a node. The usual self-stabilizing model only allows for nodes to make decisi...
Wayne Goddard, Stephen T. Hedetniemi, David Pokras...
VMCAI
2010
Springer
15 years 7 months ago
Best Probabilistic Transformers
This paper investigates relative precision and optimality of analyses for concurrent probabilistic systems. Aiming at the problem at the heart of probabilistic model checking ? com...
Björn Wachter, Lijun Zhang