Search Sciweavers | Sciweavers

2108 search results - page 282 / 422

» Tracking in Reinforcement Learning

113

Voted

ML
2002
ACM

133views Machine Learning» more ML 2002»

Finite-time Analysis of the Multiarmed Bandit Problem

15 years 3 months ago

Download homes.dsi.unimi.it

Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...

Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...

claim paper

Read More »

139

Voted

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 1 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

165

Voted

JMLR
2010

141views more JMLR 2010»

Pinview: Implicit Feedback in Content-Based Image Retrieval

14 years 10 months ago

Download jmlr.csail.mit.edu

This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...

Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...

claim paper

Read More »

150

click to vote

KDD
2005
ACM

178views Data Mining» more KDD 2005»

Failure detection and localization in component based systems by online tracking

15 years 9 months ago

Download www.nec-labs.com

The increasing complexity of today’s systems makes fast and accurate failure detection essential for their use in mission-critical applications. Various monitoring methods provi...

Haifeng Chen, Guofei Jiang, Cristian Ungureanu, Ke...

claim paper

Read More »

141

Voted

NIPS
2007

267views Information Technology» more NIPS 2007»

People Tracking with the Laplacian Eigenmaps Latent Variable Model

15 years 5 months ago

Download books.nips.cc

Reliably recovering 3D human pose from monocular video requires models that bias the estimates towards typical human poses and motions. We construct priors for people tracking usi...

Zhengdong Lu, Miguel Á. Carreira-Perpi&ntil...

claim paper

Read More »

« Prev « First page 282 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers