Sciweavers

2011 search results - page 167 / 403
» Universal Reinforcement Learning
Sort
View
TREC
2007
15 years 5 months ago
FUB, IASI-CNR and University of Tor Vergata at TREC 2007 Blog Track
We present a fully automatic and weighted dictionary to be used in topical opinion retrieval. We also define a simple topical opinion retrieval function that is free from paramete...
Gianni Amati, Edgardo Ambrosi, Marco Bianchi, Carl...
143
Voted
NECO
2008
146views more  NECO 2008»
15 years 4 months ago
Deep, Narrow Sigmoid Belief Networks Are Universal Approximators
In this paper we show that exponentially deep belief networks [3, 7, 4] can approximate any distribution over binary vectors to arbitrary accuracy, even when the width of each lay...
Ilya Sutskever, Geoffrey E. Hinton
EPIA
2007
Springer
15 years 10 months ago
Generalization and Transfer Learning in Noise-Affected Robot Navigation Tasks
Abstract. When a robot learns to solve a goal-directed navigation task with reinforcement learning, the acquired strategy can usually exclusively be applied to the task that has be...
Lutz Frommberger
161
Voted
IJCAI
2001
15 years 6 months ago
Rational and Convergent Learning in Stochastic Games
This paper investigates the problem of policy learning in multiagent environments using the stochastic game framework, which we briefly overview. We introduce two properties as de...
Michael H. Bowling, Manuela M. Veloso
155
Voted
ATAL
2004
Springer
15 years 10 months ago
A Pheromone-Based Utility Model for Collaborative Foraging
Multi-agent research often borrows from biology, where remarkable examples of collective intelligence may be found. One interesting example is ant colonies’ use of pheromones as...
Liviu Panait, Sean Luke