Sciweavers

JAIR
2008
119views more  JAIR 2008»
13 years 4 months ago
A Multiagent Reinforcement Learning Algorithm with Non-linear Dynamics
Several multiagent reinforcement learning (MARL) algorithms have been proposed to optimize agents' decisions. Due to the complexity of the problem, the majority of the previo...
Sherief Abdallah, Victor R. Lesser
JAIR
2008
148views more  JAIR 2008»
13 years 4 months ago
Learning Partially Observable Deterministic Action Models
We present exact algorithms for identifying deterministic-actions' effects and preconditions in dynamic partially observable domains. They apply when one does not know the ac...
Eyal Amir, Allen Chang
JAIR
2008
110views more  JAIR 2008»
13 years 4 months ago
Completeness and Performance Of The APO Algorithm
Asynchronous Partial Overlay (APO) is a search algorithm that uses cooperative mediation to solve Distributed Constraint Satisfaction Problems (DisCSPs). The algorithm partitions ...
Tal Grinshpoun, Amnon Meisels
JAIR
2008
173views more  JAIR 2008»
13 years 4 months ago
Computational Logic Foundations of KGP Agents
This paper presents the computational logic foundations of a model of agency called the KGP (Knowledge, Goals and Plan) model. This model allows the specification of heterogeneous...
Antonis C. Kakas, Paolo Mancarella, Fariba Sadri, ...
JAIR
2008
93views more  JAIR 2008»
13 years 4 months ago
Dynamic Control in Real-Time Heuristic Search
Vadim Bulitko, Mitja Lustrek, Jonathan Schaeffer, ...
JAIR
2008
126views more  JAIR 2008»
13 years 4 months ago
Cooperative Search with Concurrent Interactions
In this paper we show how taking advantage of autonomous agents' capability to maintain parallel interactions with others, and incorporating it into the cooperative economic ...
Efrat Manisterski, David Sarne, Sarit Kraus
JAIR
2008
104views more  JAIR 2008»
13 years 4 months ago
M-DPOP: Faithful Distributed Implementation of Efficient Social Choice Problems
In the efficient social choice problem, the goal is to assign values, subject to side constraints, to a set of variables to maximize the total utility across a population of agent...
Adrian Petcu, Boi Faltings, David C. Parkes
JAIR
2008
130views more  JAIR 2008»
13 years 4 months ago
Online Planning Algorithms for POMDPs
Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP i...
Stéphane Ross, Joelle Pineau, Sébast...
JAIR
2008
104views more  JAIR 2008»
13 years 4 months ago
On the Qualitative Comparison of Decisions Having Positive and Negative Features
Making a decision is often a matter of listing and comparing positive and negative arguments. In such cases, the evaluation scale for decisions should be considered bipolar, that ...
Didier Dubois, Hélène Fargier, Jean-...