Search Sciweavers | Sciweavers

1233 search results - page 141 / 247

» Reinforcement learning

215

click to vote

Publication

352views

Efficient methods for near-optimal sequential decision making under uncertainty

16 years 2 months ago

Download fias.uni-frankfurt.de

This chapter discusses decision making under uncertainty. More specifically, it offers an overview of efficient Bayesian and distribution-free algorithms for making near-optimal se...

Christos Dimitrakakis

posted by olethros

Read More »

166

Voted

AAMAS
2005
Springer

126views Intelligent Agents» more AAMAS 2005»

Learning to Coordinate Using Commitment Sequences in Cooperative Multi-agent Systems

16 years 14 days ago

Download como.vub.ac.be

We report on an investigation of the learning of coordination in cooperative multi-agent systems. Speciﬁcally, we study solutions that are applicable to independent agents i.e. ...

Spiros Kapetanakis, Daniel Kudenko, Malcolm J. A. ...

claim paper

Read More »

207

Voted

AAAI
2007

122views Intelligent Agents» more AAAI 2007»

RETALIATE: Learning Winning Policies in First-Person Shooter Games

15 years 9 months ago

Download www.cse.lehigh.edu

In this paper we present RETALIATE, an online reinforcement learning algorithm for developing winning policies in team firstperson shooter games. RETALIATE has three crucial chara...

Megan Smith, Stephen Lee-Urban, Hector Muño...

claim paper

Read More »

224

click to vote

COST
2009
Springer

185views Multimedia» more COST 2009»

How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?

15 years 4 months ago

Download gaussier.free.fr

Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...

Ken Prepin, Philippe Gaussier

claim paper

Read More »

201

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 4 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

« Prev « First page 141 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers