Search Sciweavers | Sciweavers

1233 search results - page 191 / 247

» Reinforcement Learning in MirrorBot

117

Voted

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 3 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

148

Voted

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 1 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

167

click to vote

JMLR
2010

141views more JMLR 2010»

Pinview: Implicit Feedback in Content-Based Image Retrieval

14 years 10 months ago

Download jmlr.csail.mit.edu

This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...

Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...

claim paper

Read More »

137

click to vote

ROBOCUP
2007
Springer

167views Robotics» more ROBOCUP 2007»

Cooperative/Competitive Behavior Acquisition Based on State Value Estimation of Others

15 years 10 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the curse of dimension problem when they are applied to multiagent dynamic environments. One of the typical...

Kentarou Noma, Yasutake Takahashi, Minoru Asada

claim paper

Read More »

138

click to vote

KI
2007
Springer

124views Artificial Intelligence» more KI 2007»

Making a Robot Learn to Play Soccer Using Reward and Punishment

15 years 10 months ago

Download www.ni.uos.de

In this paper, we show how reinforcement learning can be applied to real robots to achieve optimal robot behavior. As example, we enable an autonomous soccer robot to learn interce...

Heiko Müller, Martin Lauer, Roland Hafner, Sa...

claim paper

Read More »

« Prev « First page 191 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers