Search Sciweavers | Sciweavers

166

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

16 years 6 months ago

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

177

click to vote

CORR
2007
Springer

106views Education» more CORR 2007»

Bandit Algorithms for Tree Search

15 years 5 months ago

Download hal.inria.fr

Bandit based methods for tree search have recently gained popularity when applied to huge trees, e.g. in the game of go [6]. Their eﬃcient exploration of the tree enables to ret...

Pierre-Arnaud Coquelin, Rémi Munos

claim paper

Read More »

142

click to vote

ATAL
2003
Springer

105views Intelligent Agents» more ATAL 2003»

Locating moving entities in indoor environments with teams of mobile robots

15 years 11 months ago

Download www.cs.cmu.edu

This article presents an implemented multi-robot system for playing the popular game of laser tag. The object of the game is to search for and tag opponents that can move freely a...

Matthew Rosencrantz, Geoffrey J. Gordon, Sebastian...

claim paper

Read More »

169

click to vote

PODC
2010
ACM

144views Distributed and Parallel Com...» more PODC 2010»

Finding mobile data under delay constraints with searching costs

15 years 9 months ago

Download mobiledata.cs.gc.cuny.edu

A token is hidden in one of several boxes and then the boxes are locked. The probability of placing the token in each of the boxes is known. A searcher is looking for the token by...

Amotz Bar-Noy, Panagiotis Cheilaris, Yi Feng 0002,...

claim paper

Read More »

187

click to vote

BIBE
2008
IEEE

142views Bioinformatics» more BIBE 2008»

Optimizing performance, cost, and sensitivity in pairwise sequence search on a cluster of PlayStations

16 years 6 days ago

Download people.cs.vt.edu

— The Smith-Waterman algorithm is a dynamic programming method for determining optimal local alignments between nucleotide or protein sequences. However, it suffers from quadrati...

Ashwin M. Aji, Wu-chun Feng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers