Search Sciweavers | Sciweavers

575 search results - page 58 / 115

» Reinforcement Learning State Estimator

click to vote

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

15 years 6 months ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

click to vote

BIOADIT
2004
Springer

90views Information Technology» more BIOADIT 2004»

Autonomous Acquisition of the Meaning of Sensory States Through Sensory-Invariance Driven Action

15 years 5 months ago

Download faculty.cs.tamu.edu

Abstract. How can artiﬁcial or natural agents autonomously gain understanding of its own internal (sensory) state? This is an important question not just for physically embodied ...

Yoonsuck Choe, S. Kumar Bhamidipati

claim paper

Read More »

110

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

14 years 10 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

Voted

IJCAI
2003

118views Artificial Intelligence» more IJCAI 2003»

Simultaneous Adversarial Multi-Robot Learning

15 years 1 months ago

Download www.cs.cmu.edu

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

127

Voted

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

15 years 17 days ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 58 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers