Sciweavers

1631 search results - page 158 / 327
» Observing Functional Logic Computations
Sort
View
CDC
2009
IEEE
132views Control Systems» more  CDC 2009»
15 years 8 months ago
Q-learning and Pontryagin's Minimum Principle
Abstract— Q-learning is a technique used to compute an optimal policy for a controlled Markov chain based on observations of the system controlled using a non-optimal policy. It ...
Prashant G. Mehta, Sean P. Meyn
AIPS
2008
15 years 6 months ago
Multiagent Planning Under Uncertainty with Stochastic Communication Delays
We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...
Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...
IADIS
2003
15 years 5 months ago
Understanding Expert Search Strategies for Designing User-Friendly Search Interfaces
Web search engines face an extremely heterogeneous user population from web novices to highly skilled experts. Currently, the search strategies of the experienced web searchers ar...
Anne Aula, Mika Käki
AAAI
2012
13 years 6 months ago
Tree-Based Solution Methods for Multiagent POMDPs with Delayed Communication
Planning under uncertainty is an important and challenging problem in multiagent systems. Multiagent Partially Observable Markov Decision Processes (MPOMDPs) provide a powerful fr...
Frans Adriaan Oliehoek, Matthijs T. J. Spaan
COMMA
2008
15 years 5 months ago
Requirements for reflective argument visualization tools: A Case for Using Validity as a Normative Standard
This paper formulates in the first part some requirements for a certain sort of computational argumentation systems, namely those which are designed for a very specific purpose: to...
Michael H. G. Hoffmann