Sciweavers

71 search results - page 3 / 15
» Relative Entropy Policy Search
Sort
View
AAAI
2007
13 years 8 months ago
Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison
Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
EVOW
2009
Springer
14 years 1 months ago
Evolutionary Optimization Guided by Entropy-Based Discretization
The Learnable Evolution Model (LEM) involves alternating periods of optimization and learning, performa extremely well on a range of problems, a specialises in achieveing good resu...
Guleng Sheri, David W. Corne
ATAL
2007
Springer
14 years 14 days ago
Transfer via inter-task mappings in policy search reinforcement learning
The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
MM
2005
ACM
243views Multimedia» more  MM 2005»
13 years 12 months ago
Image region entropy: a measure of "visualness" of web images associated with one concept
We propose a new method to measure “visualness” of concepts, that is, what extent concepts have visual characteristics. To know which concept has visually discriminative power...
Keiji Yanai, Kobus Barnard
SIGIR
2005
ACM
13 years 12 months ago
Relevance information: a loss of entropy but a gain for IDF?
When investigating alternative estimates for term discriminativeness, we discovered that relevance information and idf are much closer related than formulated in classical literat...
Arjen P. de Vries, Thomas Rölleke