Search Sciweavers | Sciweavers

1235 search results - page 141 / 247

» ABC Reinforcement Learning

102

click to vote

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 7 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

115

click to vote

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 4 months ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

136

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 9 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

115

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

15 years 8 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

120

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 4 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

« Prev « First page 141 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers