Sciweavers

1641 search results - page 51 / 329
» Termination Analysis with Algorithmic Learning
Sort
View
95
Voted
ICML
2008
IEEE
16 years 4 months ago
An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning
We show that linear value-function approximation is equivalent to a form of linear model approximation. We then derive a relationship between the model-approximation error and the...
Ronald Parr, Lihong Li, Gavin Taylor, Christopher ...
125
Voted
ICA
2004
Springer
15 years 9 months ago
Post-nonlinear Independent Component Analysis by Variational Bayesian Learning
Post-nonlinear (PNL) independent component analysis (ICA) is a generalisation of ICA where the observations are assumed to have been generated from independent sources by linear mi...
Alexander Ilin, Antti Honkela
186
Voted
AAMAS
2006
Springer
15 years 3 months ago
An Evolutionary Dynamical Analysis of Multi-Agent Learning in Iterated Games
In this paper, we investigate Reinforcement learning (RL) in multi-agent systems (MAS) from an evolutionary dynamical perspective. Typical for a MAS is that the environment is not ...
Karl Tuyls, Pieter Jan't Hoen, Bram Vanschoenwinke...
124
Voted
GECCO
2007
Springer
167views Optimization» more  GECCO 2007»
15 years 9 months ago
Genetically designed heuristics for the bin packing problem
The bin packing problem (BPP) is a real-world problem that arises in different industrial applications related to minimization of space or time. The aim of this research is to au...
Oana Muntean
198
Voted
ICML
2007
IEEE
16 years 4 months ago
Discriminant analysis in correlation similarity measure space
Correlation is one of the most widely used similarity measures in machine learning like Euclidean and Mahalanobis distances. However, compared with proposed numerous discriminant ...
Yong Ma, Shihong Lao, Erina Takikawa, Masato Kawad...