Search Sciweavers | Sciweavers

122

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 8 months ago

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

110

click to vote

STACS
1997
Springer

137views Theoretical Computer Science» more STACS 1997»

Methods and Applications of (MAX, +) Linear Algebra

15 years 5 months ago

Download www-rocq.inria.fr

Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...

Stephane Gaubert, Max Plus

claim paper

Read More »

120

click to vote

ATAL
2007
Springer

145views Intelligent Agents» more ATAL 2007»

Interactive dynamic influence diagrams

15 years 5 months ago

Download www.sci.brooklyn.cuny.edu

This paper extends the framework of dynamic influence diagrams (DIDs) to the multi-agent setting. DIDs are computational representations of the Partially Observable Markov Decisio...

Kyle Polich, Piotr J. Gmytrasiewicz

claim paper

Read More »

106

click to vote

AI
2006
Springer

110views Artificial Intelligence» more AI 2006»

An Efficient Resource Allocation Approach in Real-Time Stochastic Environment

15 years 5 months ago

Download www.damas.ift.ulaval.ca

We are interested in contributing to solving effectively a particular type of real-time stochastic resource allocation problem. Firstly, one distinction is that certain tasks may c...

Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...

claim paper

Read More »

115

click to vote

CORR
2011
Springer

175views Education» more CORR 2011»

Adaptive Channel Recommendation for Dynamic Spectrum Access

14 years 8 months ago

Download home.ie.cuhk.edu.hk

—We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average rewa...

Xu Chen, Jianwei Huang, Husheng Li

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers