Search Sciweavers | Sciweavers

1325 search results - page 116 / 265

» Algorithm Selection using Reinforcement Learning

164

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 7 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

183

click to vote

GECCO
2006
Springer

138views Optimization» more GECCO 2006»

Does overfitting affect performance in estimation of distribution algorithms

15 years 10 months ago

Download www.cs.york.ac.uk

Estimation of Distribution Algorithms (EDAs) are a class of evolutionary algorithms that use machine learning techniques to solve optimization problems. Machine learning is used t...

Hao Wu, Jonathan L. Shapiro

claim paper

Read More »

180

click to vote

ATAL
2004
Springer

197views Intelligent Agents» more ATAL 2004»

Adaptive, Distributed Control of Constrained Multi-Agent Systems

16 years 1 days ago

Download collectives.stanford.edu

Product Distribution (PD) theory was recently developed as a framework for analyzing and optimizing distributed systems. In this paper we demonstrate its use for adaptive distribu...

Stefan Bieniawski, David Wolpert

claim paper

Read More »

194

click to vote

PE
2011
Springer

215views Optimization» more PE 2011»

Energy-aware routing in the Cognitive Packet Network

15 years 1 months ago

Download san.ee.ic.ac.uk

An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is speciﬁed for the �...

Toktam Mahmoodi

claim paper

Read More »

158

click to vote

FSS
2006

114views more FSS 2006»

Fuzzy logic based variable step size algorithm for blind delayed source separation

15 years 6 months ago

Download www.ece.uic.edu

Convergence of blind delayed source separation algorithms, which use constant learning rates, is known to be slow. We propose a fuzzy logic based approach to adaptively select the...

Vivek Nigam, Roland Priemer

claim paper

Read More »

« Prev « First page 116 / 265 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers