Search Sciweavers | Sciweavers

99 search results - page 3 / 20

» A Reinforcement Learning Approach for Multiagent Navigation

171

click to vote

IJAIT
2008

60views more IJAIT 2008»

A Hybrid Multiagent Reinforcement Learning Approach Using Strategies and Fusion

15 years 7 months ago

Download lpis.csd.auth.gr

Ioannis Partalas, Ioannis Feneris, Ioannis P. Vlah...

claim paper

Read More »

217

Voted

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

16 years 20 days ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

220

click to vote

ICMLA
2010

207views Machine Learning» more ICMLA 2010»

Multi-Agent Inverse Reinforcement Learning

15 years 5 months ago

Download ftp.cs.wisc.edu

Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship l...

Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah,...

claim paper

Read More »

222

click to vote

ROBOCUP
2005
Springer

134views Robotics» more ROBOCUP 2005»

Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System

16 years 28 days ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning approaches have been suﬀering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup...

Yasutake Takahashi, Kazuhiro Edazawa, Kentarou Nom...

claim paper

Read More »

192

click to vote

ICML
2002
IEEE

133views Machine Learning» more ICML 2002»

Coordinated Reinforcement Learning

16 years 8 months ago

Download select.cs.cmu.edu

We present several new algorithms for multiagent reinforcement learning. A common feature of these algorithms is a parameterized, structured representation of a policy or value fu...

Carlos Guestrin, Michail G. Lagoudakis, Ronald Par...

claim paper

Read More »

« Prev « First page 3 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers