Sciweavers

590 search results - page 37 / 118
» Can We Learn to Beat the Best Stock
Sort
View
LAMAS
2005
Springer
15 years 3 months ago
Unifying Convergence and No-Regret in Multiagent Learning
We present a new multiagent learning algorithm, RVσ(t), that builds on an earlier version, ReDVaLeR . ReDVaLeR could guarantee (a) convergence to best response against stationary ...
Bikramjit Banerjee, Jing Peng
ICML
2007
IEEE
15 years 10 months ago
On the role of tracking in stationary environments
It is often thought that learning algorithms that track the best solution, as opposed to converging to it, are important only on nonstationary problems. We present three results s...
Richard S. Sutton, Anna Koop, David Silver
ICML
2010
IEEE
14 years 10 months ago
Internal Rewards Mitigate Agent Boundedness
Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...
Jonathan Sorg, Satinder P. Singh, Richard Lewis
ACL
2006
14 years 11 months ago
Using Machine-Learning to Assign Function Labels to Parser Output for Spanish
Data-driven grammatical function tag assignment has been studied for English using the Penn-II Treebank data. In this paper we address the question of whether such methods can be ...
Grzegorz Chrupala, Josef van Genabith
ATAL
2004
Springer
15 years 3 months ago
Learning User Preferences for Wireless Services Provisioning
The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...
George Lee, Steven Bauer, Peyman Faratin, John Wro...