Sciweavers

168 search results - page 28 / 34
» Optimism in Reinforcement Learning Based on Kullback-Leibler...
Sort
View
GECCO
2008
Springer
128views Optimization» more  GECCO 2008»
14 years 10 months ago
Adapted Pittsburgh classifier system: building accurate strategies in non markovian environments
This paper focuses on the study of the behavior of a genetic algorithm based classifier system, the Adapted Pittsburgh Classifier System (A.P.C.S), on maze type environments con...
Gilles Énée, Mathias Péroumal...
AAAI
2008
14 years 11 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
PE
2011
Springer
215views Optimization» more  PE 2011»
14 years 4 months ago
Energy-aware routing in the Cognitive Packet Network
An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is specified for the ...
Toktam Mahmoodi
EPIA
2007
Springer
15 years 3 months ago
Intelligent Farmer Agent for Multi-agent Ecological Simulations Optimization
Abstract. This paper presents the development of a bivalve farmer agent interacting with a realistic ecological simulation system. The purpose of the farmer agent is to determine t...
Filipe Cruz, António Pereira, Pedro Valente...
UAI
2003
14 years 10 months ago
On the Convergence of Bound Optimization Algorithms
Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...
Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...