Search Sciweavers | Sciweavers

7342 search results - page 906 / 1469

» Optimal Language Learning

159

click to vote

EWRL
2008

144views Machine Learning» more EWRL 2008»

Regularized Fitted Q-Iteration: Application to Planning

15 years 8 months ago

Download eprints.pascal-network.org

We consider planning in a Markovian decision problem, i.e., the problem of finding a good policy given access to a generative model of the environment. We propose to use fitted Q-i...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

142

click to vote

ALT
2010
Springer

136views Machine Learning» more ALT 2010»

Consistency of Feature Markov Processes

15 years 8 months ago

Download www.hutter1.net

We are studying long term sequence prediction (forecasting). We approach this by investigating criteria for choosing a compact useful state representation. The state is supposed t...

Peter Sunehag, Marcus Hutter

claim paper

Read More »

152

click to vote

ESANN
2007

219views Neural Networks» more ESANN 2007»

Agglomerative Independent Variable Group Analysis

15 years 8 months ago

Download www.cis.hut.fi

Independent Variable Group Analysis (IVGA) is a method for grouping dependent variables together while keeping mutually independent or weakly dependent variables in separate group...

Antti Honkela, Jeremias Seppä, Esa Alhoniemi

claim paper

Read More »

193

click to vote

UAI
2008

236views Artificial Intelligence» more UAI 2008»

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

15 years 8 months ago

Download uai2008.cs.helsinki.fi

Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...

Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...

claim paper

Read More »

154

click to vote

NIPS
2003

137views Information Technology» more NIPS 2003»

Online Passive-Aggressive Algorithms

15 years 7 months ago

Download jmlr.csail.mit.edu

We present a family of margin based online learning algorithms for various prediction tasks. In particular we derive and analyze algorithms for binary and multiclass categorizatio...

Shai Shalev-Shwartz, Koby Crammer, Ofer Dekel, Yor...

claim paper

Read More »

« Prev « First page 906 / 1469 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers