Search Sciweavers | Sciweavers

473 search results - page 82 / 95

» Optimal policy switching algorithms for reinforcement learni...

113

click to vote

COLT
2010
Springer

191views Machine Learning» more COLT 2010»

Best Arm Identification in Multi-Armed Bandits

14 years 9 months ago

Download www.di.ens.fr

We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...

Jean-Yves Audibert, Sébastien Bubeck, R&eac...

claim paper

Read More »

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 6 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

110

click to vote

PE
2011
Springer

215views Optimization» more PE 2011»

Energy-aware routing in the Cognitive Packet Network

14 years 6 months ago

Download san.ee.ic.ac.uk

An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is speciﬁed for the �...

Toktam Mahmoodi

claim paper

Read More »

click to vote

EMO
2005
Springer

107views Optimization» more EMO 2005»

Multiobjective Water Pinch Analysis of the Cuernavaca City Water Distribution Network

15 years 5 months ago

Download ccc.inaoep.mx

Water systems often allow eﬃcient water uses via water reuse and/or recirculation. Deﬁning the network layout connecting water-using processes is a complex problem which involv...

Carlos E. Mariano-Romero, Víctor Alcocer-Ya...

claim paper

Read More »

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

15 years 6 months ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 82 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers