Search Sciweavers | Sciweavers

2905 search results - page 267 / 581

» Learning in Hyperlinked Environments

129

Voted

ML
1998
ACM

136views Machine Learning» more ML 1998»

Co-Evolution in the Successful Learning of Backgammon Strategy

15 years 3 months ago

Download www.demo.cs.brandeis.edu

Following Tesauro’s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of t...

Jordan B. Pollack, Alan D. Blair

claim paper

Read More »

171

Voted

ML
1998
ACM

153views Machine Learning» more ML 1998»

Bayesian Landmark Learning for Mobile Robot Localization

15 years 3 months ago

Download www.cs.cmu.edu

To operate successfully in indoor environments, mobile robots must be able to localize themselves. Most current localization algorithms lack ﬂexibility, autonomy, and often optim...

Sebastian Thrun

claim paper

Read More »

121

Voted

ICRA
2010
IEEE

128views Robotics» more ICRA 2010»

A game-theoretic procedure for learning hierarchically structured strategies

15 years 2 months ago

Download homepages.inf.ed.ac.uk

— This paper addresses the problem of acquiring a hierarchically structured robotic skill in a nonstationary environment. This is achieved through a combination of learning primi...

Benjamin Rosman, Subramanian Ramamoorthy

claim paper

Read More »

231

Voted

IAT
2010
IEEE

224views Intelligent Agents» more IAT 2010»

Concept Learning Games: The Game of Query and Response

15 years 25 days ago

Download www.znu.ac.ir

Abstract--This article deals with the issue of concept learning and tries to have a game theoretic view over the process of cooperative concept learning among agents in a multi-age...

Nima Mirbakhsh, Arman Didandeh, Mohsen Afsharchi

claim paper

Read More »

114

Voted

ICML
2002
IEEE

113views Machine Learning» more ICML 2002»

Learning from Scarce Experience

16 years 4 months ago

Download www.cs.ucr.edu

Searching the space of policies directly for the optimal policy has been one popular method for solving partially observable reinforcement learning problems. Typically, with each ...

Leonid Peshkin, Christian R. Shelton

claim paper

Read More »

« Prev « First page 267 / 581 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers