Search Sciweavers | Sciweavers

96 search results - page 2 / 20

» Adding Reinforcement Learning Features to the Neural-Gas Met...

click to vote

ESANN
2006

192views Neural Networks» more ESANN 2006»

Margin based Active Learning for LVQ Networks

13 years 6 months ago

Download www2.in.tu-clausthal.de

In this article, we extend a local prototype-based learning model by active learning, which gives the learner the capability to select training samples during the model adaptation...

Frank-Michael Schleif, Barbara Hammer, Thomas Vill...

claim paper

Read More »

click to vote

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

13 years 7 months ago

Download webdocs.cs.ualberta.ca

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

click to vote

ICML
2003
IEEE

157views Machine Learning» more ICML 2003»

Action Elimination and Stopping Conditions for Reinforcement Learning

14 years 6 months ago

Download www.hpl.hp.com

We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...

Eyal Even-Dar, Shie Mannor, Yishay Mansour

claim paper

Read More »

click to vote

ACMICEC
2008
ACM

272views ECommerce» more ACMICEC 2008»

Adapting the interaction state model in conversational recommender systems

13 years 7 months ago

Download www.inf.unibz.it

Conventional conversational recommender systems support interaction strategies that are hard-coded into the system in advance. In this context, Reinforcement Learning techniques h...

Tariq Mahmood, Francesco Ricci

claim paper

Read More »

click to vote

IEEEPACT
2008
IEEE

136views Distributed And Parallel Com...» more IEEEPACT 2008»

Feature selection and policy optimization for distributed instruction placement using reinforcement learning

13 years 11 months ago

Download userweb.cs.utexas.edu

Communication overheads are one of the fundamental challenges in a multiprocessor system. As the number of processors on a chip increases, communication overheads and the distribu...

Katherine E. Coons, Behnam Robatmili, Matthew E. T...

claim paper

Read More »

« Prev « First page 2 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers