Search Sciweavers | Sciweavers

267 search results - page 37 / 54

» The Dynamics of Multi-Agent Reinforcement Learning

175

click to vote

AR
2008

118views more AR 2008»

Efficient Behavior Learning Based on State Value Estimation of Self and Others

15 years 5 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...

Yasutake Takahashi, Kentarou Noma, Minoru Asada

claim paper

Read More »

147

click to vote

NIPS
1996

134views Information Technology» more NIPS 1996»

Why did TD-Gammon Work?

15 years 7 months ago

Download www.cse.unsw.edu.au

Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

156

click to vote

IROS
2008
IEEE

165views Robotics» more IROS 2008»

Mutual development of behavior acquisition and recognition based on value system

16 years 2 days ago

Download www.er.ams.eng.osaka-u.ac.jp

Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...

Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada

claim paper

Read More »

168

click to vote

CIG
2005
IEEE

162views Applied Computing» more CIG 2005»

Nannon: A Nano Backgammon for Machine Learning Research

15 years 11 months ago

Download cswww.essex.ac.uk

A newly designed game is introduced, which feels like Backgammon, but has a simplified rule set. Unlike earlier attempts at simplifying the game, Nannon maintains enough features a...

Jordan B. Pollack

claim paper

Read More »

167

click to vote

SASO
2008
IEEE

125views Control Systems» more SASO 2008»

Self-Adaptive Dissemination of Data in Dynamic Sensor Networks

16 years 1 days ago

Download www.datafusionlab.org

The distribution of data in large dynamic wireless sensor networks presents a difﬁcult problem due to node mobility, link failures, and trafﬁc congestion. In this paper, we pr...

David Dorsey, Bjorn Jay Carandang, Moshe Kam, Chri...

claim paper

Read More »

« Prev « First page 37 / 54 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers