Search Sciweavers | Sciweavers

119

COLT
1995
Springer

124views Machine Learning» more COLT 1995»

A Comparison of New and Old Algorithms for a Mixture Estimation Problem

15 years 3 months ago

We investigate the problem of estimating the proportion vector which maximizes the likelihood of a given sample for a mixture of given densities. We adapt a framework developed for...

David P. Helmbold, Yoram Singer, Robert E. Schapir...

claim paper

Read More »

92

click to vote

ATAL
2008
Springer

124views Intelligent Agents» more ATAL 2008»

Social reward shaping in the prisoner's dilemma

15 years 1 months ago

Download www.aamas-conference.org

Reward shaping is a well-known technique applied to help reinforcement-learning agents converge more quickly to nearoptimal behavior. In this paper, we introduce social reward sha...

Monica Babes, Enrique Munoz de Cote, Michael L. Li...

claim paper

Read More »

89

click to vote

ECIS
2004

123views Information Technology» more ECIS 2004»

Open University vs. Consorzio Nettuno: an institutional analysis of two techonology enabled higher educational systems

15 years 1 months ago

Download is2.lse.ac.uk

Assuming a rational perspective, the adoption and development of a new organisational technology can be viewed as a way to achieve an higher level of efficiency by finding the bes...

Flavia Blumetti, Paolo Ferri, Cristiano Ghiringhel...

claim paper

Read More »

108

click to vote

NIPS
2003

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

15 years 1 months ago

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This pap...

Gerald Tesauro

claim paper

Read More »

89

click to vote

GECCO
2008
Springer

172views Optimization» more GECCO 2008»

Recursive least squares and quadratic prediction in continuous multistep problems

15 years 27 days ago

Download www.cs.bham.ac.uk

XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the ...

Daniele Loiacono, Pier Luca Lanzi

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers