Sciweavers

453 search results - page 57 / 91
» Learning from actions not taken: a multiagent learning algor...
Sort
View
ATAL
2006
Springer
15 years 1 months ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
NIPS
2007
14 years 11 months ago
Learning Visual Attributes
We present a probabilistic generative model of visual attributes, together with an efficient learning algorithm. Attributes are visual qualities of objects, such as ‘red’, ...
Vittorio Ferrari, Andrew Zisserman
ICALT
2010
IEEE
14 years 8 months ago
Course Ranking and Automated Suggestions through Web Mining
—This paper introduces new metrics for course evaluation. It is also proposes a ranking algorithm that classifies courses based on the previous course evaluation metrics and sugg...
Stavros Valsamidis, Ioannis Kazanidis, Sotirios Ko...
AAI
2010
195views more  AAI 2010»
14 years 6 months ago
Automatic Extraction of Go Game Positions from Images: a Multi-Strategical Approach to Constrained Multi-Object Recognition
Here, we present a constrained object recognition task that has been robustly solved largely with simple machine learning methods, using a small corpus of about 100 images taken u...
Alexander K. Seewald
COLT
2010
Springer
14 years 7 months ago
Open Loop Optimistic Planning
We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...
Sébastien Bubeck, Rémi Munos