Transition Probabilities

30

CDC
2009
IEEE

133views Control Systems» more CDC 2009»

Arbitrarily modulated Markov decision processes

14 years 2 months ago

— We consider decision-making problems in Markov decision processes where both the rewards and the transition probabilities vary in an arbitrary (e.g., nonstationary) fashion. We...

Jia Yuan Yu, Shie Mannor

claim paper

Read More »

37

click to vote

WEBDB
2010
Springer

155views Database» more WEBDB 2010»

Learning Topical Transition Probabilities in Click Through Data with Regression Models

14 years 2 months ago

Download webdb2010.org

The transition of search engine users’ intents has been studied for a long time. The knowledge of intent transition, once discovered, can yield a better understanding of how di�...

Xiao Zhang, Prasenjit Mitra

claim paper

Read More »

40

click to vote

AAMAS
2007
Springer

164views Intelligent Agents» more AAMAS 2007»

Networks of Learning Automata and Limiting Games

14 years 3 months ago

Download como.vub.ac.be

Learning Automata (LA) were recently shown to be valuable tools for designing Multi-Agent Reinforcement Learning algorithms. One of the principal contributions of LA theory is that...

Peter Vrancx, Katja Verbeeck, Ann Nowé

claim paper

Read More »

26

click to vote

ICDAR
2009
IEEE

161views Document Analysis» more ICDAR 2009»

Learning Rich Hidden Markov Models in Document Analysis: Table Location

14 years 4 months ago

Download homepages.inf.ed.ac.uk

Hidden Markov Models (HMM) are probabilistic graphical models for interdependent classification. In this paper we experiment with different ways of combining the components of an ...

Ana Costa e Silva

claim paper

Read More »

24

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

14 years 10 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers