Search Sciweavers | Sciweavers

813 search results - page 104 / 163

» Ensemble Algorithms in Reinforcement Learning

153

click to vote

ICDM
2003
IEEE

134views Data Mining» more ICDM 2003»

Cost-Sensitive Learning by Cost-Proportionate Example Weighting

15 years 9 months ago

Download hunch.net

We propose and evaluate a family of methods for converting classiﬁer learning algorithms and classiﬁcation theory into cost-sensitive algorithms and theory. The proposed conve...

Bianca Zadrozny, John Langford, Naoki Abe

claim paper

Read More »

164

click to vote

GECCO
2006
Springer

198views Optimization» more GECCO 2006»

Reward allotment in an event-driven hybrid learning classifier system for online soccer games

15 years 8 months ago

Download www.cs.bham.ac.uk

This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our a...

Yuji Sato, Yosuke Akatsuka, Takenori Nishizono

claim paper

Read More »

100

click to vote

ATAL
2004
Springer

221views Intelligent Agents» more ATAL 2004»

When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents

15 years 9 months ago

Download leibniz.cs.huji.ac.il

This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, speciﬁcally, geneticlearning-parentin...

Michael Berger, Jeffrey S. Rosenschein

claim paper

Read More »

173

click to vote

EMNLP
2007

122views Natural Language Processing» more EMNLP 2007»

Single Malt or Blended? A Study in Multilingual Parser Optimization

15 years 5 months ago

Download acl.ldc.upenn.edu

We describe a two-stage optimization of the MaltParser system for the ten languages in the multilingual track of the CoNLL 2007 shared task on dependency parsing. The ﬁrst stage...

Johan Hall, Jens Nilsson, Joakim Nivre, Gülse...

claim paper

Read More »

151

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 5 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

« Prev « First page 104 / 163 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers