Search Sciweavers | Sciweavers

60 search results - page 12 / 12

» Iteratively Extending Time Horizon Reinforcement Learning

click to vote

ATAL
2006
Springer

147views Intelligent Agents» more ATAL 2006»

Efficient agents for cliff-edge environments with a large set of decision options

13 years 9 months ago

Download www.umiacs.umd.edu

This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...

Ron Katz, Sarit Kraus

claim paper

Read More »

click to vote

ICARCV
2008
IEEE

199views Robotics» more ICARCV 2008»

Error propagation suppression in Self-servo Track Writer by time-domain control design

13 years 11 months ago

Download mizugaki.iis.u-tokyo.ac.jp

—Control design of Self-servo Track Writer (SSTW) has become an important issue in Hard Disk Drive research. This paper discusses the error propagation problem in SSTW control. A...

Sehoon Oh, Yoichi Hori

claim paper

Read More »

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

13 years 6 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

SIGIR
2009
ACM

120views Information Technology» more SIGIR 2009»

Temporal collaborative filtering with adaptive neighbourhoods

13 years 11 months ago

Download www.cs.ucl.ac.uk

Recommender Systems, based on collaborative ﬁltering (CF), aim to accurately predict user tastes, by minimising the mean error achieved on hidden test sets of user ratings, afte...

Neal Lathia, Stephen Hailes, Licia Capra

claim paper

Read More »

click to vote

DSP
2008

166views Emerging Technology» more DSP 2008»

Extension of higher-order HMC modeling with application to image segmentation

13 years 5 months ago

Download www.multitel.be

In this work, we propose to improve the neighboring relationship ability of the Hidden Markov Chain (HMC) model, by extending the memory lengthes of both the Markov chain process ...

Lamia Benyoussef, Cyril Carincotte, Stéphan...

claim paper

Read More »

« Prev « First page 12 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers