Search Sciweavers | Sciweavers

2011 search results - page 268 / 403

» Universal Reinforcement Learning

134

click to vote

AI
2002
Springer

117views Artificial Intelligence» more AI 2002»

Programming backgammon using self-teaching neural nets

15 years 4 months ago

Download www.math-info.univ-paris5.fr

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...

Gerald Tesauro

claim paper

Read More »

127

click to vote

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 4 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

155

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 3 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

180

click to vote

JMLR
2010

141views more JMLR 2010»

Pinview: Implicit Feedback in Content-Based Image Retrieval

14 years 11 months ago

Download jmlr.csail.mit.edu

This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...

Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...

claim paper

Read More »

152

click to vote

CHI
2009
ACM

131views Human Computer Interaction» more CHI 2009»

Enhancing input device evaluation: longitudinal approaches

16 years 5 months ago

Download hci.uni-konstanz.de

Jens Gerken HCI Group, University of Konstanz Box D-73 78457 Konstanz, Germany jens.gerken@uni-konstanz.de Hans-Joachim Bieg HCI Group, University of Konstanz Box D-73 78457 Konsta...

Jens Gerken, Hans-Joachim Bieg, Stefan Dierdorf, H...

claim paper

Read More »

« Prev « First page 268 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers