Sciweavers

651 search results - page 113 / 131
» Algorithms for Inverse Reinforcement Learning
Sort
View
PE
2011
Springer
215views Optimization» more  PE 2011»
14 years 4 months ago
Energy-aware routing in the Cognitive Packet Network
An energy aware routing protocol (EARP) is proposed to minimise a performance metric that combines the total consumed power in the network and the QoS that is specified for the ...
Toktam Mahmoodi
NIPS
1998
14 years 11 months ago
Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms
In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...
Michael J. Kearns, Satinder P. Singh
68
Voted
WWW
2006
ACM
15 years 10 months ago
Finding advertising keywords on web pages
A large and growing number of web pages display contextual advertising based on keywords automatically extracted from the text of the page, and this is a substantial source of rev...
Wen-tau Yih, Joshua Goodman, Vitor R. Carvalho
78
Voted
AAMAS
2010
Springer
14 years 9 months ago
Coordinated learning in multiagent MDPs with infinite state-space
Abstract In this paper we address the problem of simultaneous learning and coordination in multiagent Markov decision problems (MMDPs) with infinite state-spaces. We separate this ...
Francisco S. Melo, M. Isabel Ribeiro
GECCO
2006
Springer
195views Optimization» more  GECCO 2006»
15 years 1 months ago
Studying XCS/BOA learning in Boolean functions: structure encoding and random Boolean functions
Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, conseque...
Martin V. Butz, Martin Pelikan