Sciweavers

813 search results - page 104 / 163
» Ensemble Algorithms in Reinforcement Learning
Sort
View
ICDM
2003
IEEE
134views Data Mining» more  ICDM 2003»
15 years 9 months ago
Cost-Sensitive Learning by Cost-Proportionate Example Weighting
We propose and evaluate a family of methods for converting classifier learning algorithms and classification theory into cost-sensitive algorithms and theory. The proposed conve...
Bianca Zadrozny, John Langford, Naoki Abe
GECCO
2006
Springer
198views Optimization» more  GECCO 2006»
15 years 8 months ago
Reward allotment in an event-driven hybrid learning classifier system for online soccer games
This paper describes our study into the concept of using rewards in a classifier system applied to the acquisition of decision-making algorithms for agents in a soccer game. Our a...
Yuji Sato, Yosuke Akatsuka, Takenori Nishizono
ATAL
2004
Springer
15 years 9 months ago
When to Apply the Fifth Commandment: The Effects of Parenting on Genetic and Learning Agents
This paper explores hybrid agents that use a variety of techniques to improve their performance in an environment over time. We considered, specifically, geneticlearning-parentin...
Michael Berger, Jeffrey S. Rosenschein
EMNLP
2007
15 years 5 months ago
Single Malt or Blended? A Study in Multilingual Parser Optimization
We describe a two-stage optimization of the MaltParser system for the ten languages in the multilingual track of the CoNLL 2007 shared task on dependency parsing. The first stage...
Johan Hall, Jens Nilsson, Joakim Nivre, Gülse...
NIPS
2008
15 years 5 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake