Sciweavers

13 search results - page 3 / 3
» Model-free reinforcement learning as mixture learning
Sort
View
LWA
2007
13 years 6 months ago
Towards Learning User-Adaptive State Models in a Conversational Recommender System
Typical conversational recommender systems support interactive strategies that are hard-coded in advance and followed rigidly during a recommendation session. In fact, Reinforceme...
Tariq Mahmood, Francesco Ricci
ICCS
2007
Springer
13 years 11 months ago
Towards Real-Time Distributed Signal Modeling for Brain-Machine Interfaces
New architectures for Brain-Machine Interface communication and control use mixture models for expanding rehabilitation capabilities of disabled patients. Here we present and test ...
Jack DiGiovanna, Loris Marchal, Prapaporn Rattanat...
JAIR
2011
187views more  JAIR 2011»
13 years 9 days ago
A Monte-Carlo AIXI Approximation
This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...
Joel Veness, Kee Siong Ng, Marcus Hutter, William ...