Search Sciweavers | Sciweavers

210 search results - page 32 / 42

» An analysis of reinforcement learning with function approxim...

159

click to vote

RSKT
2009
Springer

136views Information Technology» more RSKT 2009»

Learning Optimal Parameters in Decision-Theoretic Rough Sets

15 years 11 months ago

Download www2.cs.uregina.ca

A game-theoretic approach for learning optimal parameter values for probabilistic rough set regions is presented. The parameters can be used to deﬁne approximation regions in a p...

Joseph P. Herbert, Jingtao Yao

claim paper

Read More »

159

click to vote

ML
2002
ACM

168views Machine Learning» more ML 2002»

On Average Versus Discounted Reward Temporal-Difference Learning

15 years 4 months ago

Download web.mit.edu

We provide an analytical comparison between discounted and average reward temporal-difference (TD) learning with linearly parameterized approximations. We first consider the asympt...

John N. Tsitsiklis, Benjamin Van Roy

claim paper

Read More »

135

click to vote

ALT
2009
Springer

143views Machine Learning» more ALT 2009»

Approximation Algorithms for Tensor Clustering

16 years 1 months ago

Download www-users.cs.umn.edu

Abstract. We present the ﬁrst (to our knowledge) approximation algorithm for tensor clustering—a powerful generalization to basic 1D clustering. Tensors are increasingly common...

Stefanie Jegelka, Suvrit Sra, Arindam Banerjee

claim paper

Read More »

160

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 5 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

148

click to vote

EC
2006

121views ECommerce» more EC 2006»

A Study of Structural and Parametric Learning in XCS

15 years 5 months ago

Download www.cs.bris.ac.uk

The performance of a learning classifier system is due to its two main components. First, it evolves new structures by generating new rules in a genetic process; second, it adjust...

Tim Kovacs, Manfred Kerber

claim paper

Read More »

« Prev « First page 32 / 42 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers