Sciweavers

3084 search results - page 150 / 617
» Learning to Take Actions
Sort
View
COLT
2008
Springer
14 years 12 months ago
Regret Bounds for Sleeping Experts and Bandits
We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...
Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...
WSPI
2008
14 years 11 months ago
Practices, Systems, and Context Working as Core Concepts in Modeling Socio-Technical Systems
This work draws on the cultural historical activity-theory and the theory of social systems to model socio-technical systems. The concepts of practice, system, and context work as ...
Heidrun Allert, Christoph Richter
CORR
2010
Springer
106views Education» more  CORR 2010»
14 years 10 months ago
MDPs with Unawareness
Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...
Joseph Y. Halpern, Nan Rong, Ashutosh Saxena
AAAI
2011
13 years 10 months ago
Differential Eligibility Vectors for Advantage Updating and Gradient Methods
In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...
Francisco S. Melo

Publication
352views
15 years 5 months ago
Efficient methods for near-optimal sequential decision making under uncertainty
This chapter discusses decision making under uncertainty. More specifically, it offers an overview of efficient Bayesian and distribution-free algorithms for making near-optimal se...
Christos Dimitrakakis