Search Sciweavers | Sciweavers

876 search results - page 84 / 176

» On a theory of learning with similarity functions

136

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Contextual Bandit Learning with Predictable Rewards

13 years 2 months ago

Download www.cs.princeton.edu

Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...

Alekh Agarwal, Miroslav Dudík, Satyen Kale,...

claim paper

Read More »

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

15 years 3 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

click to vote

MS
2003

118views Modeling And Simulation» more MS 2003»

Information-theoretic Competitive Learning

15 years 1 months ago

Download www.cs.stir.ac.uk

— In this paper, we propose a new supervised learning method whereby information is controlled by the associated cost in an intermediate layer, and in an output layer, errors bet...

Ryotaro Kamimura

claim paper

Read More »

108

click to vote

MM
2009
ACM

269views Multimedia» more MM 2009»

Semi-supervised topic modeling for image annotation

15 years 6 months ago

Download www.shaoyuanlong.com

We propose a novel technique for semi-supervised image annotation which introduces a harmonic regularizer based on the graph Laplacian of the data into the probabilistic semantic ...

Yuanlong Shao, Yuan Zhou, Xiaofei He, Deng Cai, Hu...

claim paper

Read More »

click to vote

ATAL
2009
Springer

155views Intelligent Agents» more ATAL 2009»

Learning equilibria in repeated congestion games

15 years 6 months ago

Download www.cs.huji.ac.il

While the class of congestion games has been thoroughly studied in the multi-agent systems literature, settings with incomplete information have received relatively little attenti...

Moshe Tennenholtz, Aviv Zohar

claim paper

Read More »

« Prev « First page 84 / 176 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers