Search Sciweavers | Sciweavers

2108 search results - page 281 / 422

» Tracking in Reinforcement Learning

144

Voted

EWCBR
2008
Springer

224views Automated Reasoning» more EWCBR 2008»

Discovering Feature Weights for Feature-based Indexing of Q-tables

15 years 5 months ago

Download www.cse.lehigh.edu

In this paper we propose an approach to address the old problem of identifying the feature conditions under which a gaming strategy can be effective. For doing this, we will build ...

Chad Hogg, Stephen Lee-Urban, Bryan Auslander, H&e...

claim paper

Read More »

click to vote

AIPS
2006

129views Artificial Intelligence» more AIPS 2006»

Reusing and Building a Policy Library

15 years 5 months ago

Download www.cs.cmu.edu

Policy Reuse is a method to improve reinforcement learning with the ability to solve multiple tasks by building upon past problem solving experience, as accumulated in a Policy Li...

Fernando Fernández, Manuela M. Veloso

claim paper

Read More »

129

Voted

NN
2006
Springer

140views Neural Networks» more NN 2006»

Neural mechanism for stochastic behaviour during a competitive game

15 years 3 months ago

Download wanglab.med.yale.edu

Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...

Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang

claim paper

Read More »

143

Voted

TSMC
2008

135views more TSMC 2008»

Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator

15 years 3 months ago

Download www.icasa.nmt.edu

Abstract--This research proposes a use of an agent-based intelligent simulator to numerically examine the influence of a transmission line limit on the dynamics of a wholesale powe...

Toshiyuki Sueyoshi, Gopalakrishna Reddy Tadiparthi

claim paper

Read More »

120

Voted

AI
2002
Springer

117views Artificial Intelligence» more AI 2002»

Programming backgammon using self-teaching neural nets

15 years 3 months ago

Download www.math-info.univ-paris5.fr

TD-Gammon is a neural network that is able to teach itself to play backgammon solely by playing against itself and learning from the results. Starting from random initial play, TD...

Gerald Tesauro

claim paper

Read More »

« Prev « First page 281 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers