Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

9

ML
1998
ACM

favoriteEmaildiscussreport

117views Machine Learning» more ML 1998»

Learning Team Strategies: Soccer Case Studies

13 years 4 months ago

Learning Team Strategies: Soccer Case Studies

Download igitur-archive.library.uu.nl

We use simulated soccer to study multiagent learning. Each team's players (agents) share action set and policy, but may behave di erently due to position-dependent inputs. All agents making up a team are rewarded or punished collectively in case of goals. We conduct simulations with varying team sizes, and compare several learning algorithms: TD-Q learning with linear neural networks (TD-Q), Probabilistic Incremental Program Evolution (PIPE), and a PIPE version that learns by coevolution (CO-PIPE). TD-Q is based on learning evaluation functions (EFs) mapping input/action pairs to expected reward. PIPE and CO-PIPE search policy space directly. They use adaptive probability distributions to synthesize programs that calculate action probabilities from current inputs. Our results show that linear TD-Q encounters several di culties in learning appropriate shared EFs. PIPE and CO-PIPE, however, do not depend on EFs and nd good policies faster and more reliably. This suggests that in som...

Rafal Salustowicz, Marco Wiering, Jürgen Schm

Real-time Traffic

Linear Neural Networks | Machine Learning | ML 1998 | Policy Space | Probabilistic Incremental Program |

claim paper

Related Content

» On Learning Soccer Strategies

» Towards a LeagueIndependent Qualitative Soccer Theory for RoboCup

» Artificial Intelligence and Systems Theory Applied to Cooperative Robots

» Towards collaborative and adversarial learning a case study in robotic soccer

» Reinforcement Learning Soccer Teams with Incomplete World Models

» The Evolution of a Robot Soccer Team

» Learning to Select Negotiation Strategies in Multiagent Meeting Scheduling

» Reward allotment in an eventdriven hybrid learning classifier system for online soccer gam...

» Using a TwoLayered CaseBased Reasoning for Prediction in Soccer Coach

Post Info
More Details (n/a)

Added	22 Dec 2010
Updated	22 Dec 2010
Type	Journal
Year	1998
Where	ML
Authors	Rafal Salustowicz, Marco Wiering, Jürgen Schmidhuber

Comments (0)