Sciweavers

945 search results - page 142 / 189
» Dialog Convergence and Learning
Sort
View
COLT
1995
Springer
15 years 1 months ago
A Comparison of New and Old Algorithms for a Mixture Estimation Problem
We investigate the problem of estimating the proportion vector which maximizes the likelihood of a given sample for a mixture of given densities. We adapt a framework developed for...
David P. Helmbold, Yoram Singer, Robert E. Schapir...
78
Voted
ATAL
2008
Springer
14 years 11 months ago
Social reward shaping in the prisoner's dilemma
Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...
Monica Babes, Enrique Munoz de Cote, Michael L. Li...
72
Voted
ECIS
2004
14 years 11 months ago
Open University vs. Consorzio Nettuno: an institutional analysis of two techonology enabled higher educational systems
Assuming a rational perspective, the adoption and development of a new organisational technology can be viewed as a way to achieve an higher level of efficiency by finding the bes...
Flavia Blumetti, Paolo Ferri, Cristiano Ghiringhel...
NIPS
2003
14 years 11 months ago
Extending Q-Learning to General Adaptive Multi-Agent Systems
Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...
Gerald Tesauro
70
Voted
GECCO
2008
Springer
172views Optimization» more  GECCO 2008»
14 years 10 months ago
Recursive least squares and quadratic prediction in continuous multistep problems
XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the ...
Daniele Loiacono, Pier Luca Lanzi