Search Sciweavers | Sciweavers

1955 search results - page 173 / 391

» Online Multitask Learning

click to vote

ECML
2005
Springer

95views Machine Learning» more ECML 2005»

Towards Finite-Sample Convergence of Direct Reinforcement Learning

15 years 9 months ago

Download www.cs.uiuc.edu

Abstract. While direct, model-free reinforcement learning often performs better than model-based approaches in practice, only the latter have yet supported theoretical guarantees f...

Shiau Hong Lim, Gerald DeJong

claim paper

Read More »

119

Voted

FORMATS
2004
Springer

116views Formal Methods» more FORMATS 2004»

Learning of Event-Recording Automata

15 years 8 months ago

Download www4.in.tum.de

Abstract. We extend Angluin’s algorithm for on-line learning of regular languages to the setting of timed systems. We consider systems that can be described by a class of determi...

Olga Grinchtein, Bengt Jonsson, Martin Leucker

claim paper

Read More »

126

Voted

KES
2004
Springer

165views Information Technology» more KES 2004»

Coordination in Multiagent Reinforcement Learning Systems

15 years 8 months ago

Download cig.ees.kyushu-u.ac.jp

This paper presents a novel method for on-line coordination in multiagent reinforcement learning systems. In this method a reinforcement-learning agent learns to select its action ...

M. A. S. Kamal, Junichi Murata

claim paper

Read More »

110

Voted

SOCRATES
2008

105views Education» more SOCRATES 2008»

Laboratory Door Opens to Non-formal Learning Communities. Science Centres as Mediators

15 years 4 months ago

Download sunsite.informatik.rwth-aachen.de

The e-KNOWNET is a Lifelong Learning project, which aims to develop an innovative and viable mechanism to facilitate the flow of new scientific knowledge produced in the research ...

G. Anyfandi, V. Laopodis, V. Koulaidis, Nicolas Ap...

claim paper

Read More »

132

click to vote

NIPS
2003

118views Information Technology» more NIPS 2003»

Learning Curves for Stochastic Gradient Descent in Linear Feedforward Networks

15 years 4 months ago

Download books.nips.cc

Gradient-following learning methods can encounter problems of implementation in many applications, and stochastic variants are frequently used to overcome these difﬁculties. We ...

Justin Werfel, Xiaohui Xie, H. Sebastian Seung

claim paper

Read More »

« Prev « First page 173 / 391 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers