Search Sciweavers | Sciweavers

62 search results - page 7 / 13

» Learning and Exploiting Relative Weaknesses of Opponent Agen...

click to vote

ATAL
2005
Springer

126views Intelligent Agents» more ATAL 2005»

Theory of moves learners: towards non-myopic equilibria

15 years 5 months ago

Download euler.mcs.utulsa.edu

In contrast to classical game theoretic analysis of simultaneous and sequential play in bimatrix games, Steven Brams has proposed an alternative framework called the Theory of Mov...

Arjita Ghosh, Sandip Sen

claim paper

Read More »

136

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

14 years 9 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

ATAL
2007
Springer

81views Intelligent Agents» more ATAL 2007»

Multiagent learning in adaptive dynamic systems

15 years 5 months ago

Download www.damas.ift.ulaval.ca

Classically, an approach to the multiagent policy learning supposed that the agents, via interactions and/or by using preliminary knowledge about the reward functions of all playe...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

click to vote

AAAI
2007

142views Intelligent Agents» more AAAI 2007»

Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison

15 years 2 months ago

Download staff.science.uva.nl

Reinforcement learning (RL) methods have become popular in recent years because of their ability to solve complex tasks with minimal feedback. Both genetic algorithms (GAs) and te...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

109

click to vote

CEC
2010
IEEE

161views Artificial Intelligence» more CEC 2010»

Coordinate System Archive for coevolution

15 years 25 days ago

Download www.cs.put.poznan.pl

Problems in which some entities interact with each other are common in computational intelligence. This scenario, typical for co-evolving artificial-life agents, learning strategie...

Wojciech Jaskowski, Krzysztof Krawiec

claim paper

Read More »

« Prev « First page 7 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers