Search Sciweavers | Sciweavers

64 search results - page 12 / 13

» Reducing the complexity of multiagent reinforcement learning

click to vote

ICRA
2003
IEEE

165views Robotics» more ICRA 2003»

Multi-robot task-allocation through vacancy chains

13 years 10 months ago

Download www-robotics.usc.edu

Existing task allocation algorithms generally do not consider the eﬀects of task interaction, such as interference, but instead assume that tasks are independent. That assumptio...

Torbjørn S. Dahl, Maja J. Mataric, Gaurav S...

claim paper

Read More »

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

13 years 9 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

click to vote

IJRR
2011

159views more IJRR 2011»

Learning visual representations for perception-action systems

13 years 8 days ago

Download robot-learning.de

We discuss vision as a sensory modality for systems that eﬀect actions in response to perceptions. While the internal representations informed by vision may be arbitrarily compl...

Justus H. Piater, Sébastien Jodogne, Renaud...

claim paper

Read More »

click to vote

ATAL
2006
Springer

100views Intelligent Agents» more ATAL 2006»

Learning to identify winning coalitions in the PAC model

13 years 9 months ago

Download www.cs.huji.ac.il

We consider PAC learning of simple cooperative games, in which the coalitions are partitioned into "winning" and "losing" coalitions. We analyze the complexity...

Ariel D. Procaccia, Jeffrey S. Rosenschein

claim paper

Read More »

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

14 years 2 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 12 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers