Sciweavers

949 search results - page 56 / 190
» Can Doxastic Agents Learn
Sort
View
84
Voted
AAAI
1997
15 years 2 months ago
Reinforcement Learning with Time
This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...
Daishi Harada
CEEMAS
2005
Springer
15 years 6 months ago
A Direct Reputation Model for VO Formation
We show that reputation is a basic ingredient in the Virtual Organisation (VO) formation process. Agents can use their experiences gained in direct past interactions to model other...
Arturo Avila-Rosas, Michael Luck
94
Voted
ATAL
2005
Springer
15 years 6 months ago
Modeling opponent decision in repeated one-shot negotiations
In many negotiation and bargaining scenarios, a particular agent may need to interact repeatedly with another agent. Typically, these interactions take place under incomplete info...
Sabyasachi Saha, Anish Biswas, Sandip Sen
88
Voted
DIS
2006
Springer
15 years 4 months ago
A Pragmatic Logic of Scientific Discovery
Abstract. To the best of our knowledge, this paper is the first attempt to formalise a pragmatic logic of scientific discovery in a manner such that it can be realised by scientist...
Jean Sallantin, Christopher Dartnell, Mohammad Afs...
ICML
1997
IEEE
16 years 1 months ago
Hierarchical Explanation-Based Reinforcement Learning
Explanation-Based Reinforcement Learning (EBRL) was introduced by Dietterich and Flann as a way of combining the ability of Reinforcement Learning (RL) to learn optimal plans with...
Prasad Tadepalli, Thomas G. Dietterich