Search Sciweavers | Sciweavers

81 search results - page 17 / 17

» The Optimal Reward Baseline for Gradient-Based Reinforcement...

click to vote

LWA
2007

160views Software Engineering» more LWA 2007»

Towards Learning User-Adaptive State Models in a Conversational Recommender System

13 years 6 months ago

Download users.informatik.uni-halle.de

Typical conversational recommender systems support interactive strategies that are hard-coded in advance and followed rigidly during a recommendation session. In fact, Reinforceme...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

« Prev « First page 17 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers