Sciweavers

1955 search results - page 173 / 391
» Online Multitask Learning
Sort
View
ECML
2005
Springer
15 years 9 months ago
Towards Finite-Sample Convergence of Direct Reinforcement Learning
Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...
Shiau Hong Lim, Gerald DeJong
119
Voted
FORMATS
2004
Springer
15 years 8 months ago
Learning of Event-Recording Automata
Abstract. We extend Angluin’s algorithm for on-line learning of regular languages to the setting of timed systems. We consider systems that can be described by a class of determi...
Olga Grinchtein, Bengt Jonsson, Martin Leucker
126
Voted
KES
2004
Springer
15 years 8 months ago
Coordination in Multiagent Reinforcement Learning Systems
This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...
M. A. S. Kamal, Junichi Murata
110
Voted
SOCRATES
2008
105views Education» more  SOCRATES 2008»
15 years 4 months ago
Laboratory Door Opens to Non-formal Learning Communities. Science Centres as Mediators
The e-KNOWNET is a Lifelong Learning project, which aims to develop an innovative and viable mechanism to facilitate the flow of new scientific knowledge produced in the research ...
G. Anyfandi, V. Laopodis, V. Koulaidis, Nicolas Ap...
NIPS
2003
15 years 4 months ago
Learning Curves for Stochastic Gradient Descent in Linear Feedforward Networks
Gradient-following learning methods can encounter problems of implementation in many applications, and stochastic variants are frequently used to overcome these difficulties. We ...
Justin Werfel, Xiaohui Xie, H. Sebastian Seung