Search Sciweavers | Sciweavers

1799 search results - page 190 / 360

» Filtered Reinforcement Learning

135

click to vote

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 4 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

156

click to vote

MAGS
2010

81views more MAGS 2010»

Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation

14 years 11 months ago

Download damas.ift.ulaval.ca

Coordinating agents in a complex environment is a hard problem, but it can become even harder when certain characteristics of the tasks, like the required number of agents, are un...

Sébastien Paquet, Brahim Chaib-draa, Patric...

claim paper

Read More »

126

click to vote

ICML
2007
IEEE

162views Machine Learning» more ICML 2007»

Restricted Boltzmann machines for collaborative filtering

16 years 5 months ago

Download www.machinelearning.org

Most of the existing approaches to collaborative filtering cannot handle very large data sets. In this paper we show how a class of two-layer undirected graphical models, called R...

Ruslan Salakhutdinov, Andriy Mnih, Geoffrey E. Hin...

claim paper

Read More »

125

click to vote

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 5 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

137

click to vote

FBIT
2007
IEEE

142views Information Technology» more FBIT 2007»

Learning to Drive a Real Car in 20 Minutes

15 years 10 months ago

Download www.ni.uos.de

The paper describes our ﬁrst experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...

Martin Riedmiller, Michael Montemerlo, Hendrik Dah...

claim paper

Read More »

« Prev « First page 190 / 360 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers