Search Sciweavers | Sciweavers

40 search results - page 3 / 8

» Parametric regret in uncertain Markov decision processes

click to vote

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

13 years 7 months ago

Download books.nips.cc

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

click to vote

ANOR
2004

143views more ANOR 2004»

Model Independent Parametric Decision Making

13 years 5 months ago

Download sol.rutgers.edu

Accurate knowledge of the effect of parameter uncertainty on process design and operation is essential for optimal and feasible operation of a process plant. Existing approaches de...

Ipsita Banerjee, Marianthi G. Ierapetritou

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 4 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

CORR
2011
Springer

202views Education» more CORR 2011»

Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems

13 years 17 days ago

Download www.ualberta.ca

The analysis of online least squares estimation is at the heart of many stochastic sequential decision-making problems. We employ tools from the self-normalized processes to provi...

Yasin Abbasi-Yadkori, Dávid Pál, Csa...

claim paper

Read More »

click to vote

ATAL
2004
Springer

88views Intelligent Agents» more ATAL 2004»

Interactive POMDPs: Properties and Preliminary Results

13 years 11 months ago

Download dali.ai.uic.edu

This paper presents properties and results of a new framework for sequential decision-making in multiagent settings called interactive partially observable Markov decision process...

Piotr J. Gmytrasiewicz, Prashant Doshi

claim paper

Read More »

« Prev « First page 3 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers