Search Sciweavers | Sciweavers

166 search results - page 12 / 34

» Online model learning in adversarial Markov decision process...

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 5 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 4 days ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

14 years 10 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

116

click to vote

CORR
2010
Springer

171views Education» more CORR 2010»

Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach

14 years 6 months ago

Download www.eecs.umich.edu

We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...

Cem Tekin, Mingyan Liu

claim paper

Read More »

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

15 years 5 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

« Prev « First page 12 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers