Sciweavers

2457 search results - page 391 / 492
» Local Search with Bootstrapping
Sort
View
ICML
2006
IEEE
15 years 10 months ago
Using inaccurate models in reinforcement learning
In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...
Pieter Abbeel, Morgan Quigley, Andrew Y. Ng
ICML
2006
IEEE
15 years 10 months ago
Bayesian pattern ranking for move prediction in the game of Go
We investigate the problem of learning to predict moves in the board game of Go from game records of expert players. In particular, we obtain a probability distribution over legal...
David H. Stern, Ralf Herbrich, Thore Graepel
ICML
1995
IEEE
15 years 10 months ago
Ant-Q: A Reinforcement Learning Approach to the Traveling Salesman Problem
In this paper we introduce Ant-Q, a family of algorithms which present many similarities with Q-learning (Watkins, 1989), and which we apply to the solution of symmetric and asymm...
Luca Maria Gambardella, Marco Dorigo
ICSE
2009
IEEE-ACM
15 years 10 months ago
Locating need-to-translate constant strings for software internationalization
Modern software applications require internationalization to be distributed to different regions of the world. In various situations, many software applications are not internatio...
Xiaoyin Wang, Lu Zhang, Tao Xie, Hong Mei, Jiasu S...
ISBI
2006
IEEE
15 years 10 months ago
Molecular image registration using mutual information and differential evolution optimization
In this work we propose a novel rigid image registration approach to determine the position of high-resolution molecular structures in medium-resolution macromolecular complexes. ...
Bartosz Telenczuk, María J. Ledesma-Carbayo...