Sciweavers

1753 search results - page 176 / 351
» State Machines
Sort
View
ICML
2000
IEEE
16 years 5 months ago
Convergence Problems of General-Sum Multiagent Reinforcement Learning
Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...
Michael H. Bowling
ICML
2000
IEEE
16 years 5 months ago
Learning Probabilistic Models for Decision-Theoretic Navigation of Mobile Robots
Decision-theoretic reasoning and planning algorithms are increasingly being used for mobile robot navigation, due to the signi cant uncertainty accompanying the robots' perce...
Daniel Nikovski, Illah R. Nourbakhsh
TPHOL
2007
IEEE
15 years 11 months ago
Operational Reasoning for Concurrent Caml Programs and Weak Memory Models
This paper concerns the formal semantics of programming languages, and the specification and verification of software. We are interested in the verification of real programs, wr...
Tom Ridge
106
Voted
ECML
2007
Springer
15 years 11 months ago
Safe Q-Learning on Complete History Spaces
In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...
Stephan Timmer, Martin Riedmiller
ICALT
2005
IEEE
15 years 10 months ago
The Use of an Adaptive Hypermedia Learning System to Support a New Pedagogical Model
The purpose of this paper is to present the current state and future development of the PLATINEA project. This project allows students and teachers to create and to consolidate kn...
Constatino Martins, Isabel Azevedo, Carlos Vaz de ...