Search Sciweavers | Sciweavers

10054 search results - page 112 / 2011

» On the Complexity of Function Learning

157

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 8 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

140

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 4 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

129

Voted

ESWA
2006

103views more ESWA 2006»

Model gene network by semi-fixed Bayesian network

15 years 3 months ago

Download www.comp.nus.edu.sg

Gene networks describe functional pathways in a given cell or tissue, representing processes such as metabolism, gene expression regulation, and protein or RNA transport. Thus, le...

Tie-Fei Liu, Wing-Kin Sung, Ankush Mittal

claim paper

Read More »

123

Voted

GECCO
2007
Springer

186views Optimization» more GECCO 2007»

ICSPEA: evolutionary five-axis milling path optimisation

15 years 9 months ago

Download www.cs.bham.ac.uk

ICSPEA is a novel multi-objective evolutionary algorithm which integrates aspects from the powerful variation operators of the Covariance Matrix Adaptation Evolution Strategy (CMA...

Jörn Mehnen, Rajkumar Roy, Petra Kersting, To...

claim paper

Read More »

113

click to vote

COLT
2000
Springer

110views Machine Learning» more COLT 2000»

Model Selection and Error Estimation

15 years 7 months ago

Download www.stanford.edu

We study model selection strategies based on penalized empirical loss minimization. We point out a tight relationship between error estimation and data-based complexity penalizatio...

Peter L. Bartlett, Stéphane Boucheron, G&aa...

claim paper

Read More »

« Prev « First page 112 / 2011 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers