Search Sciweavers | Sciweavers

55 search results - page 4 / 11

» Reinforcement Learning with Classifier Selection for Focused...

104

click to vote

EUROCAST
2007
Springer

182views Hardware» more EUROCAST 2007»

A k-NN Based Perception Scheme for Reinforcement Learning

15 years 6 months ago

Download www.dia.fi.upm.es

Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...

José Antonio Martin H., Javier de Lope Asia...

claim paper

Read More »

164

click to vote

IWCLS
2007
Springer

176views Machine Learning» more IWCLS 2007»

On Lookahead and Latent Learning in Simple LCS

15 years 6 months ago

Download www.psychologie.uni-wuerzburg.de

Learning Classifier Systems use evolutionary algorithms to facilitate rule- discovery, where rule fitness is traditionally payoff based and assigned under a sharing scheme. Most c...

Larry Bull

claim paper

Read More »

click to vote

JMLR
2006

153views more JMLR 2006»

Collaborative Multiagent Reinforcement Learning by Payoff Propagation

15 years 8 days ago

Download jmlr.csail.mit.edu

In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...

Jelle R. Kok, Nikos A. Vlassis

claim paper

Read More »

128

click to vote

CEAS
2007
Springer

188views Internet Technology» more CEAS 2007»

Learning Fast Classifiers for Image Spam

15 years 4 months ago

Download www.seas.upenn.edu

Recently, spammers have proliferated "image spam", emails which contain the text of the spam message in a human readable image instead of the message body, making detect...

Mark Dredze, Reuven Gevaryahu, Ari Elias-Bachrach

claim paper

Read More »

107

Voted

ICML
2009
IEEE

186views Machine Learning» more ICML 2009»

Regularization and feature selection in least-squares temporal difference learning

16 years 1 months ago

Download ai.stanford.edu

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (L...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 4 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers