Search Sciweavers | Sciweavers

153

MDAI
2005
Springer

138views Artificial Intelligence» more MDAI 2005»

Perceptive Evaluation for the Optimal Discounted Reward in Markov Decision Processes

15 years 11 months ago

We formulate a fuzzy perceptive model for Markov decision processes with discounted payoﬀ in which the perception for transition probabilities is described by fuzzy sets. Our aim...

Masami Kurano, Masami Yasuda, Jun-ichi Nakagami, Y...

claim paper

Read More »

183

click to vote

ICFEM
2004
Springer

172views Software Engineering» more ICFEM 2004»

Linear Inequality LTL (iLTL): A Model Checker for Discrete Time Markov Chains

15 years 11 months ago

Download osl.cs.uiuc.edu

Abstract. We develop a way of analyzing the behavior of systems modeled using Discrete Time Markov Chains (DTMC). Speciﬁcally, we deﬁne iLTL, an LTL with linear inequalities on...

YoungMin Kwon, Gul Agha

claim paper

Read More »

180

click to vote

SIGSOFT
2007
ACM

198views Software Engineering» more SIGSOFT 2007»

Quantitative verification: models techniques and tools

16 years 6 months ago

Download qav.comlab.ox.ac.uk

Automated verification is a technique for establishing if certain properties, usually expressed in temporal logic, hold for a system model. The model can be defined using a high-l...

Marta Z. Kwiatkowska

claim paper

Read More »

189

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 11 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

158

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

16 years 6 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers