Search Sciweavers | Sciweavers

221

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

16 years 2 months ago

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

177

Voted

AAMAS
2005
Springer

133views Intelligent Agents» more AAMAS 2005»

Advice-Exchange Between Evolutionary Algorithms and Reinforcement Learning Agents: Experiments in the Pursuit Domain

16 years 1 months ago

Download iscte.pt

This research aims at studying the effects of exchanging information during the learning process in Multiagent Systems. The concept of advice-exchange, introduced in (Nunes and Ol...

Luís Nunes, Eugénio C. Oliveira

claim paper

Read More »

248

click to vote

ATAL
2007
Springer

147views Intelligent Agents» more ATAL 2007»

A reinforcement learning based distributed search algorithm for hierarchical peer-to-peer information retrieval systems

15 years 11 months ago

Download www.haizhengzhang.com

The dominant existing routing strategies employed in peerto-peer(P2P) based information retrieval(IR) systems are similarity-based approaches. In these approaches, agents depend o...

Haizheng Zhang, Victor R. Lesser

claim paper

Read More »

190

click to vote

GECCO
2005
Springer

111views Optimization» more GECCO 2005»

XCS with eligibility traces

16 years 1 months ago

Download www.bcs.rochester.edu

The development of the XCS Learning Classiﬁer System has produced a robust and stable implementation that performs competitively in direct-reward environments. Although investig...

Jan Drugowitsch, Alwyn Barry

claim paper

Read More »

192

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 8 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers