Sciweavers

1325 search results - page 116 / 265
» Algorithm Selection using Reinforcement Learning
Sort
View
114
Voted
ICML
2009
IEEE
16 years 3 months ago
Near-Bayesian exploration in polynomial time
We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...
J. Zico Kolter, Andrew Y. Ng
128
Voted
GECCO
2006
Springer
138views Optimization» more  GECCO 2006»
15 years 6 months ago
Does overfitting affect performance in estimation of distribution algorithms
Estimation of Distribution Algorithms (EDAs) are a class of evolutionary algorithms that use machine learning techniques to solve optimization problems. Machine learning is used t...
Hao Wu, Jonathan L. Shapiro
132
Voted
ATAL
2004
Springer
15 years 8 months ago
Adaptive, Distributed Control of Constrained Multi-Agent Systems
Product Distribution (PD) theory was recently developed as a framework for analyzing and optimizing distributed systems. In this paper we demonstrate its use for adaptive distribu...
Stefan Bieniawski, David Wolpert
132
Voted
PE
2011
Springer
215views Optimization» more  PE 2011»
14 years 9 months ago
Energy-aware routing in the Cognitive Packet Network
An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is specified for the ...
Toktam Mahmoodi
105
Voted
FSS
2006
114views more  FSS 2006»
15 years 2 months ago
Fuzzy logic based variable step size algorithm for blind delayed source separation
Convergence of blind delayed source separation algorithms, which use constant learning rates, is known to be slow. We propose a fuzzy logic based approach to adaptively select the...
Vivek Nigam, Roland Priemer