Sciweavers

60 search results - page 12 / 12
» Iteratively Extending Time Horizon Reinforcement Learning
Sort
View
ATAL
2006
Springer
13 years 9 months ago
Efficient agents for cliff-edge environments with a large set of decision options
This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...
Ron Katz, Sarit Kraus
ICARCV
2008
IEEE
199views Robotics» more  ICARCV 2008»
13 years 11 months ago
Error propagation suppression in Self-servo Track Writer by time-domain control design
—Control design of Self-servo Track Writer (SSTW) has become an important issue in Hard Disk Drive research. This paper discusses the error propagation problem in SSTW control. A...
Sehoon Oh, Yoichi Hori
NIPS
1998
13 years 6 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
SIGIR
2009
ACM
13 years 11 months ago
Temporal collaborative filtering with adaptive neighbourhoods
Recommender Systems, based on collaborative filtering (CF), aim to accurately predict user tastes, by minimising the mean error achieved on hidden test sets of user ratings, afte...
Neal Lathia, Stephen Hailes, Licia Capra
DSP
2008
13 years 5 months ago
Extension of higher-order HMC modeling with application to image segmentation
In this work, we propose to improve the neighboring relationship ability of the Hidden Markov Chain (HMC) model, by extending the memory lengthes of both the Markov chain process ...
Lamia Benyoussef, Cyril Carincotte, Stéphan...