Sciweavers

168 search results - page 25 / 34
» Optimism in Reinforcement Learning Based on Kullback-Leibler...
Sort
View
JMLR
2010
119views more  JMLR 2010»
14 years 4 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
96
Voted
ENTER
2009
Springer
15 years 4 months ago
Learning Adaptive Recommendation Strategies for Online Travel Planning
Conversational recommender systems support human-computer interaction strategies in order to assist online tourists in the important activity of dynamic packaging, i.e., in buildi...
Tariq Mahmood, Francesco Ricci, Adriano Venturini
GECCO
2006
Springer
142views Optimization» more  GECCO 2006»
15 years 1 months ago
Classifier prediction based on tile coding
This paper introduces XCSF extended with tile coding prediction: each classifier implements a tile coding approximator; the genetic algorithm is used to adapt both classifier cond...
Pier Luca Lanzi, Daniele Loiacono, Stewart W. Wils...
ICASSP
2010
IEEE
14 years 9 months ago
Hierarchical Gaussian Mixture Model
Gaussian mixture models (GMMs) are a convenient and essential tool for the estimation of probability density functions. Although GMMs are used in many research domains from image ...
Vincent Garcia, Frank Nielsen, Richard Nock
IROS
2007
IEEE
172views Robotics» more  IROS 2007»
15 years 3 months ago
Motor control optimization of compliant one-legged locomotion in rough terrain
— While underactuated robotic systems are capable of energy efficient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...
Fumiya Iida, Russ Tedrake