Sciweavers

513 search results - page 94 / 103
» Metric learning for reinforcement learning agents
Sort
View
87
Voted
JAIR
2011
187views more  JAIR 2011»
14 years 7 months ago
A Monte-Carlo AIXI Approximation
This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...
Joel Veness, Kee Siong Ng, Marcus Hutter, William ...
119
Voted
ATAL
2006
Springer
15 years 4 months ago
Learning to commit in repeated games
Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is an open problem in multiagent learning. Our goal is to facilitate the learning ...
Stéphane Airiau, Sandip Sen
97
Voted
ATAL
2008
Springer
15 years 2 months ago
On the usefulness of opponent modeling: the Kuhn Poker case study
The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...
Alessandro Lazaric, Mario Quaresimale, Marcello Re...
96
Voted
FLAIRS
2004
15 years 1 months ago
A New Filtering Model towards an Intelligent Guide Agent
In E-learning systems, where both helpers (tutors) and learners are separated geographically, finding a reliable helper is one of the most important challenges. Although helpers c...
Mohammed Abdel Razek, Claude Frasson, Marc Kaltenb...
97
Voted
ATAL
2003
Springer
15 years 5 months ago
Team formation and communication restrictions in collectives
A collective of agents often needs to maximize a “world utility” function which rates the performance of an entire system, while subject to communication restrictions among th...
Adrian K. Agogino, Kagan Tumer