Sciweavers

1176 search results - page 140 / 236
» Exploiting Online Games
Sort
View
98
Voted
SIAMCOMP
2002
124views more  SIAMCOMP 2002»
15 years 9 days ago
The Nonstochastic Multiarmed Bandit Problem
Abstract. In the multiarmed bandit problem, a gambler must decide which arm of K nonidentical slot machines to play in a sequence of trials so as to maximize his reward. This class...
Peter Auer, Nicolò Cesa-Bianchi, Yoav Freun...
ATAL
2007
Springer
15 years 6 months ago
Advice taking in multiagent reinforcement learning
This paper proposes the β-WoLF algorithm for multiagent reinforcement learning (MARL) in the stochastic games framework that uses an additional “advice” signal to inform agen...
Michael Rovatsos, Alexandros Belesiotis
PLDI
2012
ACM
13 years 3 months ago
Effective parallelization of loops in the presence of I/O operations
Software-based thread-level parallelization has been widely studied for exploiting data parallelism in purely computational loops to improve program performance on multiprocessors...
Min Feng, Rajiv Gupta, Iulian Neamtiu
120
Voted
AAAI
2006
15 years 2 months ago
Action Selection in Bayesian Reinforcement Learning
My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...
Tao Wang
138
Voted
SIGMOD
2004
ACM
144views Database» more  SIGMOD 2004»
16 years 24 days ago
Diamond in the Rough: Finding Hierarchical Heavy Hitters in Multi-Dimensional Data
Data items archived in data warehouses or those that arrive online as streams typically have attributes which take values from multiple hierarchies (e.g., time and geographic loca...
Graham Cormode, Flip Korn, S. Muthukrishnan, Dives...