Sciweavers

5075 search results - page 183 / 1015
» Convergence
Sort
View
SIGECOM
2009
ACM
114views ECommerce» more  SIGECOM 2009»
15 years 11 months ago
Policy teaching through reward function learning
Policy teaching considers a Markov Decision Process setting in which an interested party aims to influence an agent’s decisions by providing limited incentives. In this paper, ...
Haoqi Zhang, David C. Parkes, Yiling Chen
CEC
2007
IEEE
15 years 11 months ago
How well do multi-objective evolutionary algorithms scale to large problems
Abstract— In spite of large amount of research work in multiobjective evolutionary algorithms, most have evaluated their algorithms on problems with only two to four objectives. ...
Kata Praditwong, Xin Yao
VTC
2007
IEEE
135views Communications» more  VTC 2007»
15 years 10 months ago
MBER Turbo Multiuser Beamforming Aided QPSK Receiver Design Using EXIT Chart Analysis
Abstract— This paper studies the mutual information transfer characteristics of a novel iterative soft interference cancellation (SIC) aided beamforming receiver designed for qua...
Shuang Tan, Sheng Chen, Lajos Hanzo
AAMAS
2007
Springer
15 years 10 months ago
Continuous-State Reinforcement Learning with Fuzzy Approximation
Abstract. Reinforcement learning (RL) is a widely used learning paradigm for adaptive agents. There exist several convergent and consistent RL algorithms which have been intensivel...
Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...
122
Voted
NETCOOP
2007
Springer
15 years 10 months ago
The Practical Performance of Subgradient Computational Techniques for Mesh Network Utility Optimization
In the networking research literature, the problem of network utility optimization is often converted to the dual problem which, due to nondifferentiability, is solved with a part...
Peng Wang, Stephan Bohacek