Sciweavers

513 search results - page 98 / 103
» Metric learning for reinforcement learning agents
Sort
View
KDD
2010
ACM
289views Data Mining» more  KDD 2010»
14 years 10 months ago
Exploitation and exploration in a performance based contextual advertising system
The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...
Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...
SASO
2009
IEEE
15 years 7 months ago
Self-organizing Bandwidth Sharing in Priority-Based Medium Access
In this paper, we present an analysis of self-organizing bandwidth sharing in priority-based medium access. For this purpose, the priority-based Access Game is introduced. Analysi...
Stefan Wildermann, Tobias Ziermann, Jürgen Te...
78
Voted
AAAI
2006
15 years 1 months ago
Hard Constrained Semi-Markov Decision Processes
In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...
Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong
WETICE
2000
IEEE
15 years 4 months ago
Evaluation Challenges for a Federation of Heterogeneous Information Providers: The Case of NASA's Earth Science Information Part
NASA’s Earth Science Information Partnership Federation is an experiment funded to assess the ability of a group of widely heterogeneous earth science data or service providers ...
Catherine Plaisant, Anita Komlodi, Francis Lindsay
ATAL
2011
Springer
14 years 13 days ago
Using iterated reasoning to predict opponent strategies
The field of multiagent decision making is extending its tools from classical game theory by embracing reinforcement learning, statistical analysis, and opponent modeling. For ex...
Michael Wunder, Michael Kaisers, John Robert Yaros...