Sciweavers

1351 search results - page 137 / 271
» Learning from compressed observations
Sort
View
ATAL
2010
Springer
15 years 5 months ago
Frequency adjusted multi-agent Q-learning
Multi-agent learning is a crucial method to control or find solutions for systems, in which more than one entity needs to be adaptive. In today's interconnected world, such s...
Michael Kaisers, Karl Tuyls
CIARP
2006
Springer
15 years 6 months ago
Robustness Analysis of the Neural Gas Learning Algorithm
The Neural Gas (NG) is a Vector Quantization technique where a set of prototypes self organize to represent the topology structure of the data. The learning algorithm of the Neural...
Carolina Saavedra, Sebastián Moreno, Rodrig...
IADIS
2004
15 years 5 months ago
Modelling Inductive Reasoning Ability for Adaptive Virtual Learning Environment
Inductive reasoning is one of the important characteristics of human intelligence. Researchers have regarded inductive reasoning as one of the seven primary mental abilities that ...
Taiyu Lin, Kinshuk, Paul McNab
IJCAI
2001
15 years 5 months ago
R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning
R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...
Ronen I. Brafman, Moshe Tennenholtz
UAI
2003
15 years 5 months ago
Learning Continuous Time Bayesian Networks
Continuous time Bayesian networks (CTBN) describe structured stochastic processes with finitely many states that evolve over continuous time. A CTBN is a directed (possibly cycli...
Uri Nodelman, Christian R. Shelton, Daphne Koller