Search Sciweavers | Sciweavers

13 search results - page 3 / 3

» Model-free reinforcement learning as mixture learning

click to vote

LWA
2007

160views Software Engineering» more LWA 2007»

Towards Learning User-Adaptive State Models in a Conversational Recommender System

13 years 6 months ago

Download users.informatik.uni-halle.de

Typical conversational recommender systems support interactive strategies that are hard-coded in advance and followed rigidly during a recommendation session. In fact, Reinforceme...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

click to vote

ICCS
2007
Springer

124views Applied Computing» more ICCS 2007»

Towards Real-Time Distributed Signal Modeling for Brain-Machine Interfaces

13 years 11 months ago

Download nrg.mbi.ufl.edu

New architectures for Brain-Machine Interface communication and control use mixture models for expanding rehabilitation capabilities of disabled patients. Here we present and test ...

Jack DiGiovanna, Loris Marchal, Prapaporn Rattanat...

claim paper

Read More »

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

13 years 9 days ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

« Prev « First page 3 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers