Search Sciweavers | Sciweavers

3050 search results - page 146 / 610

» On-line Algorithms in Machine Learning

146

Voted

ICML
2009
IEEE

136views Machine Learning» more ICML 2009»

Herding dynamical weights to learn

16 years 4 months ago

Download www.ics.uci.edu

A new "herding" algorithm is proposed which directly converts observed moments into a sequence of pseudo-samples. The pseudosamples respect the moment constraints and ma...

Max Welling

claim paper

Read More »

122

click to vote

ML
2007
ACM

83views Machine Learning» more ML 2007»

Learning deterministic context free grammars: The Omphalos competition

15 years 3 months ago

Download www.cs.rhul.ac.uk

This paper describes the winning entry to the Omphalos context free grammar learning competition. Our approach integrates an information theoretic constituent likelihood measure to...

Alexander Clark

claim paper

Read More »

131

Voted

ICML
2006
IEEE

193views Machine Learning» more ICML 2006»

Maximum margin planning

16 years 4 months ago

Download www.cs.cmu.edu

Mobile robots often rely upon systems that render sensor data and perceptual features into costs that can be used in a planner. The behavior that a designer wishes the planner to ...

Nathan D. Ratliff, J. Andrew Bagnell, Martin Zinke...

claim paper

Read More »

215

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

16 years 25 days ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 10 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

« Prev « First page 146 / 610 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers