Search Sciweavers | Sciweavers

43 search results - page 9 / 9

» A Game of Prediction with Expert Advice

click to vote

JMLR
2010

103views more JMLR 2010»

Regret Bounds and Minimax Policies under Partial Monitoring

12 years 11 months ago

Download jmlr.csail.mit.edu

This work deals with four classical prediction settings, namely full information, bandit, label efficient and bandit label efficient as well as four different notions of regret: p...

Jean-Yves Audibert, Sébastien Bubeck

claim paper

Read More »

click to vote

SIGECOM
2006
ACM

128views ECommerce» more SIGECOM 2006»

Controlling a supply chain agent using value-based decomposition

13 years 10 months ago

Download ai.eecs.umich.edu

We present and evaluate the design of Deep Maize, our entry in the 2005 Trading Agent Competition Supply Chain Management scenario. The central idea is to decompose the problem by...

Christopher Kiekintveld, Jason Miller, Patrick R. ...

claim paper

Read More »

click to vote

LAMAS
2005
Springer

168views Intelligent Agents» more LAMAS 2005»

Multi-agent Relational Reinforcement Learning

13 years 10 months ago

Download dtai.cs.kuleuven.be

In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a r...

Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice ...

claim paper

Read More »

« Prev « First page 9 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers