Search Sciweavers | Sciweavers

682 search results - page 34 / 137

» One-Counter Markov Decision Processes

159

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 7 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

161

click to vote

UAI
2003

87views Artificial Intelligence» more UAI 2003»

Implementation and Comparison of Solution Methods for Decision Processes with Non-Markovian Rewards

15 years 7 months ago

Download users.cecs.anu.edu.au

This paper examines a number of solution methods for decision processes with non-Markovian rewards (NMRDPs). They all exploit a temporal logic speciﬁcation of the reward functio...

Charles Gretton, David Price, Sylvie Thiéba...

claim paper

Read More »

206

Voted

AIED
2011
Springer

243views Artificial Intelligence» more AIED 2011»

Faster Teaching by POMDP Planning

14 years 9 months ago

Download louisville.edu

Both human and automated tutors must infer what a student knows and plan future actions to maximize learning. Though substantial research has been done on tracking and modeling stu...

Anna N. Rafferty, Emma Brunskill, Thomas L. Griffi...

claim paper

Read More »

163

click to vote

ICTAI
2007
IEEE

96views Artificial Intelligence» more ICTAI 2007»

Multi-criteria Decision Making for Local Coordination in Multi-agent Systems

16 years 10 days ago

Download users.info.unicaen.fr

Unlike mono-agent systems, multi-agent planing addresses the problem of resolving conﬂicts between individual and group interests. In this paper, we are using a Decentralized Ve...

Matthieu Boussard, Maroua Bouzid, Abdel-Illah Moua...

claim paper

Read More »

174

click to vote

QEST
2010
IEEE

154views Modeling and Simulation» more QEST 2010»

Symblicit Calculation of Long-Run Averages for Concurrent Probabilistic Systems

15 years 3 months ago

Download www.informatik.uni-freiburg.de

Abstract--Model checkers for concurrent probabilistic systems have become very popular within the last decade. The study of long-run average behavior has however received only scan...

Ralf Wimmer, Bettina Braitling, Bernd Becker, Erns...

claim paper

Read More »

« Prev « First page 34 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers