Sciweavers

2747 search results - page 364 / 550
» Non-oblivious Strategy Improvement
Sort
View
ATAL
2006
Springer
15 years 3 months ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
COOPIS
2004
IEEE
15 years 3 months ago
Dynamic Adaptation of Data Distribution Policies in a Shared Data Space System
Increasing demands for interconnectivity, adaptivity and flexibility are leading to distributed component-based systems (DCBS) where components may dynamically join and leave a sys...
Giovanni Russello, Michel R. V. Chaudron, Maarten ...
EGICE
2006
15 years 3 months ago
Combining Two Data Mining Methods for System Identification
System identification is an abductive task which is affected by several kinds of modeling assumptions and measurement errors. Therefore, instead of optimizing values of parameters ...
Sandro Saitta, Benny Raphael, Ian F. C. Smith
EUROPAR
2006
Springer
15 years 3 months ago
Creating and Maintaining Replicas in Unstructured Peer-to-Peer Systems
Abstract. In peer-to-peer systems, replication is an important issue as it improves search performance and data availability. It has been shown that optimal replication is attained...
Elias Leontiadis, Vassilios V. Dimakopoulos, Evagg...
ECIR
2010
Springer
15 years 22 days ago
Using the Quantum Probability Ranking Principle to Rank Interdependent Documents
A known limitation of the Probability Ranking Principle (PRP) is that it does not cater for dependence between documents. Recently, the Quantum Probability Ranking Principle (QPRP)...
Guido Zuccon, Leif Azzopardi