Sciweavers

42 search results - page 8 / 9
» ecml 2005
Sort
View
ECML
2005
Springer
13 years 11 months ago
Annealed Discriminant Analysis
Abstract. Motivated by the analogies to statistical physics, the deterministic annealing (DA) method has successfully been demonstrated in a variety of application. In this paper, ...
Gang Wang, Zhihua Zhang, Frederick H. Lochovsky
ECML
2005
Springer
13 years 11 months ago
Natural Actor-Critic
This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...
Jan Peters, Sethu Vijayakumar, Stefan Schaal
ECML
2005
Springer
13 years 11 months ago
Using Advice to Transfer Knowledge Acquired in One Reinforcement Learning Task to Another
We present a method for transferring knowledge learned in one task to a related task. Our problem solvers employ reinforcement learning to acquire a model for one task. We then tra...
Lisa Torrey, Trevor Walker, Jude W. Shavlik, Richa...
ECML
2005
Springer
13 years 10 months ago
A Distance-Based Approach for Action Recommendation
Abstract. Rule induction has attracted a great deal of attention in Machine Learning and Data Mining. However, generating rules is not an end in itself because their applicability ...
Ronan Trepos, Ansaf Salleb, Marie-Odile Cordier, V...
ECML
2005
Springer
13 years 11 months ago
Nonrigid Embeddings for Dimensionality Reduction
Spectral methods for embedding graphs and immersing data manifolds in low-dimensional speaces are notoriously unstable due to insufficient and/or numberically ill-conditioned con...
Matthew Brand