Search Sciweavers | Sciweavers

12 search results - page 3 / 3

» Finite-time Analysis of the Multiarmed Bandit Problem

click to vote

MANSCI
2007

100views more MANSCI 2007»

Dynamic Assortment with Demand Learning for Seasonal Consumer Goods

13 years 4 months ago

Download web.mit.edu

Companies such as Zara and World Co. have recently implemented novel product development processes and supply chain architectures enabling them to make more product design and ass...

Felipe Caro, Jérémie Gallien

claim paper

Read More »

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

13 years 11 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers