Search Sciweavers | Sciweavers

132 search results - page 20 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

188

click to vote

COLT
2007
Springer

118views Machine Learning» more COLT 2007»

Learning Large-Alphabet and Analog Circuits with Value Injection Queries

16 years 1 months ago

Download www.cs.yale.edu

Abstract. We consider the problem of learning an acyclic discrete circuit with n wires, fan-in bounded by k and alphabet size s using value injection queries. For the class of tran...

Dana Angluin, James Aspnes, Jiang Chen, Lev Reyzin

claim paper

Read More »

175

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

Relational temporal difference learning

16 years 8 months ago

Download cll.stanford.edu

We introduce relational temporal difference learning as an effective approach to solving multi-agent Markov decision problems with large state spaces. Our algorithm uses temporal ...

Nima Asgharbeygi, David J. Stracuzzi, Pat Langley

claim paper

Read More »

234

Voted

TNN
2010

148views Management» more TNN 2010»

Generalized low-rank approximations of matrices revisited

15 years 2 months ago

Download parnec.nuaa.edu.cn

Compared to Singular Value Decomposition (SVD), Generalized Low Rank Approximations of Matrices (GLRAM) can consume less computation time, obtain higher compression ratio, and yiel...

Jun Liu, Songcan Chen, Zhi-Hua Zhou, Xiaoyang Tan

claim paper

Read More »

196

click to vote

GECCO
2006
Springer

195views Optimization» more GECCO 2006»

Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions

15 years 11 months ago

Download www.coboslab.psychologie.uni-wuerzburg.de

Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...

Martin V. Butz, Martin Pelikan

claim paper

Read More »

223

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 7 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

« Prev « First page 20 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers