Sciweavers

1042 search results - page 74 / 209
» Failing First: An Update
Sort
View
114
Voted
AAAI
2010
15 years 2 months ago
Relative Entropy Policy Search
Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...
Jan Peters, Katharina Mülling, Yasemin Altun
90
Voted
COMAD
2008
15 years 2 months ago
The Efficient Maintenance of Access Roles with Role Hiding
Role-based access control (RBAC) has attracted considerable research interest. However, the computational issues of RBAC models are yet to be thoroughly studied. In this paper, we...
Chaoyi Pang, Xiuzhen Zhang, Yanchun Zhang, Kotagir...
MVA
2007
138views Computer Vision» more  MVA 2007»
15 years 2 months ago
Vehicle Tracking Using Image Alignment and Haar Transform
The large number of rear end collisions due to driver inattention has been identified as a major automotive safety issue. In this paper, we describe a 3-phase vehicle tracking met...
Lap-Chi Cheung, Yiu Sang Moon
107
Voted
FLAIRS
2004
15 years 2 months ago
Interactive Refinement of a Knowledge Base
This paper presents a new method of interactive refinement of a knowledge base. The first step of our method is a validation stage which checks the consistency and the completenes...
R. Djelouah, Béatrice Duval, Stéphan...
109
Voted
ICML
2010
IEEE
15 years 1 months ago
Multi-agent Learning Experiments on Repeated Matrix Games
This paper experimentally evaluates multiagent learning algorithms playing repeated matrix games to maximize their cumulative return. Previous works assessed that Qlearning surpas...
Bruno Bouzy, Marc Métivier