Sciweavers

3412 search results - page 197 / 683
» Efficient Reinforcement Learning
Sort
View
ACG
2003
Springer
15 years 9 months ago
Evaluation in Go by a Neural Network using Soft Segmentation
In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position ev...
Markus Enzenberger
255
Voted
ROBOCUP
2001
Springer
125views Robotics» more  ROBOCUP 2001»
15 years 8 months ago
Essex Wizards 2001 Team Description
This article presents an overview of the Essex Wizards 2001 team participated in the RoboCup 2001 simulator league. Four major issues have been addressed, namely a generalized appr...
Huosheng Hu, Kostas Kostiadis, Matthew Hunter, Nik...
NIPS
2004
15 years 5 months ago
Multi-agent Cooperation in Diverse Population Games
We consider multi-agent systems whose agents compete for resources by striving to be in the minority group. The agents adapt to the environment by reinforcement learning of the pr...
K. Y. Michael Wong, S. W. Lim, Zhuo Gao
125
Voted
NIPS
2003
15 years 5 months ago
Policy Search by Dynamic Programming
We consider the policy search approach to reinforcement learning. We show that if a “baseline distribution” is given (indicating roughly how often we expect a good policy to v...
J. Andrew Bagnell, Sham Kakade, Andrew Y. Ng, Jeff...
ANOR
2005
80views more  ANOR 2005»
15 years 3 months ago
Entropic Penalties in Finite Games
The main objects here are finite-strategy games in which entropic terms are subtracted from the payoffs. After such subtraction each Nash equilibrium solves an explicit, unconstra...
Sjur Didrik Flåm, E. Cavazzuti