Search Sciweavers | Sciweavers

69

AAAI
1997

107views Intelligent Agents» more AAAI 1997»

14 years 11 months ago

This paper steps back from the standard infinite horizon formulation of reinforcement learning problems to consider the simpler case of finite horizon problems. Although finite ho...

Daishi Harada

claim paper

Read More »

81

click to vote

NIPS
2003

119views Information Technology» more NIPS 2003»

All learning is Local: Multi-agent Learning in Global Reward Games

14 years 11 months ago

Download www.its.caltech.edu

In large multiagent games, partial observability, coordination, and credit assignment persistently plague attempts to design good learning algorithms. We provide a simple and ef�...

Yu-Han Chang, Tracey Ho, Leslie Pack Kaelbling

claim paper

Read More »

62

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

15 years 4 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

70

click to vote

RA
2003

135views Robotics» more RA 2003»

Behavioural Cloning and Robot Control

14 years 11 months ago

Download www.cis.utas.edu.au

Behavioural cloning is a method by which a machine learns control skills through observing what a human controller would do in a certain set of circumstances. More specifically, t...

Claire D'Este, Mark O'Sullivan, Nicholas Hannah

claim paper

Read More »

86

click to vote

NN
2006
Springer

79views Neural Networks» more NN 2006»

The misbehavior of value and the discipline of the will

14 years 10 months ago

Download www.cns.nyu.edu

Most reinforcement learning models of animal conditioning operate under the convenient, though fictive, assumption that Pavlovian conditioning concerns prediction learning whereas...

Peter Dayan, Yael Niv, Ben Seymour, Nathaniel D. D...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers