Sciweavers

160 search results - page 24 / 32
» Optimization on a Budget: A Reinforcement Learning Approach
Sort
View
DEXA
2004
Springer
172views Database» more  DEXA 2004»
15 years 5 months ago
On the Automation of Similarity Information Maintenance in Flexible Query Answering Systems
This paper proposes a method for automatic maintaining the similarity information for a particular class of Flexible Query Answering Systems (FQAS). The paper describes the three m...
Balázs Csanád Csáji, Josef K&...
ICML
2010
IEEE
14 years 9 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
JMLR
2010
119views more  JMLR 2010»
14 years 6 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
ACMICEC
2007
ACM
154views ECommerce» more  ACMICEC 2007»
15 years 3 months ago
Learning and adaptivity in interactive recommender systems
Recommender systems are intelligent E-commerce applications that assist users in a decision-making process by offering personalized product recommendations during an interaction s...
Tariq Mahmood, Francesco Ricci
107
Voted
ENTER
2009
Springer
15 years 6 months ago
Learning Adaptive Recommendation Strategies for Online Travel Planning
Conversational recommender systems support human-computer interaction strategies in order to assist online tourists in the important activity of dynamic packaging, i.e., in buildi...
Tariq Mahmood, Francesco Ricci, Adriano Venturini