Search Sciweavers | Sciweavers

513 search results - page 98 / 103

» Metric learning for reinforcement learning agents

149

click to vote

KDD
2010
ACM

289views Data Mining» more KDD 2010»

Exploitation and exploration in a performance based contextual advertising system

14 years 10 months ago

Download www.cs.umass.edu

The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...

Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...

claim paper

Read More »

106

click to vote

SASO
2009
IEEE

144views Control Systems» more SASO 2009»

Self-organizing Bandwidth Sharing in Priority-Based Medium Access

15 years 7 months ago

Download www12.informatik.uni-erlangen.de

In this paper, we present an analysis of self-organizing bandwidth sharing in priority-based medium access. For this purpose, the priority-based Access Game is introduced. Analysi...

Stefan Wildermann, Tobias Ziermann, Jürgen Te...

claim paper

Read More »

Voted

AAAI
2006

118views Intelligent Agents» more AAAI 2006»

Hard Constrained Semi-Markov Decision Processes

15 years 1 months ago

Download www.aaai.org

In multiple criteria Markov Decision Processes (MDP) where multiple costs are incurred at every decision point, current methods solve them by minimising the expected primary cost ...

Wai-Leong Yeow, Chen-Khong Tham, Wai-Choong Wong

claim paper

Read More »

114

click to vote

WETICE
2000
IEEE

135views Emerging Technology» more WETICE 2000»

Evaluation Challenges for a Federation of Heterogeneous Information Providers: The Case of NASA's Earth Science Information Part

15 years 4 months ago

Download www.landcover.org

NASA’s Earth Science Information Partnership Federation is an experiment funded to assess the ability of a group of widely heterogeneous earth science data or service providers ...

Catherine Plaisant, Anita Komlodi, Francis Lindsay

claim paper

Read More »

128

click to vote

ATAL
2011
Springer

220views Intelligent Agents» more ATAL 2011»

Using iterated reasoning to predict opponent strategies

14 years 13 days ago

Download paul.rutgers.edu

The ﬁeld of multiagent decision making is extending its tools from classical game theory by embracing reinforcement learning, statistical analysis, and opponent modeling. For ex...

Michael Wunder, Michael Kaisers, John Robert Yaros...

claim paper

Read More »

« Prev « First page 98 / 103 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers