Sciweavers

1455 search results - page 47 / 291
» Exploiting Myopic Learning
Sort
View
ATAL
2006
Springer
15 years 3 months ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
PPSN
2004
Springer
15 years 5 months ago
Ensemble Learning with Evolutionary Computation: Application to Feature Ranking
Abstract. Exploiting the diversity of hypotheses produced by evolutionary learning, a new ensemble approach for Feature Selection is presented, aggregating the feature rankings ext...
Kees Jong, Elena Marchiori, Michèle Sebag
ACMICEC
2008
ACM
272views ECommerce» more  ACMICEC 2008»
15 years 1 months ago
Adapting the interaction state model in conversational recommender systems
Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...
Tariq Mahmood, Francesco Ricci
ICML
2006
IEEE
16 years 19 days ago
An intrinsic reward mechanism for efficient exploration
How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...
Özgür Simsek, Andrew G. Barto
GECCO
2008
Springer
144views Optimization» more  GECCO 2008»
15 years 27 days ago
Self-adaptive constructivism in Neural XCS and XCSF
For artificial entities to achieve high degrees of autonomy they will need to display appropriate adaptability. In this sense adaptability includes representational flexibility gu...
Gerard David Howard, Larry Bull, Pier Luca Lanzi