Sciweavers

33 search results - page 2 / 7
» Pathologies of temporal difference methods in approximate dy...
Sort
View
CDC
2010
IEEE
139views Control Systems» more  CDC 2010»
12 years 12 months ago
Q-learning and enhanced policy iteration in discounted dynamic programming
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...
Dimitri P. Bertsekas, Huizhen Yu
NCA
2008
IEEE
13 years 4 months ago
Neurodynamic programming: a case study of the traveling salesman problem
The paper focuses on the study of solving the large-scale traveling salesman problem (TSP) based on neurodynamic programming. From this perspective, two methods, temporal differenc...
Jia Ma, Tao Yang, Zeng-Guang Hou, Min Tan, Derong ...
NIPS
1996
13 years 6 months ago
Why did TD-Gammon Work?
Although TD-Gammon is one of the major successes in machine learning, it has not led to similar impressive breakthroughs in temporal difference learning for other applications or ...
Jordan B. Pollack, Alan D. Blair
TITB
2008
111views more  TITB 2008»
13 years 4 months ago
A Spine X-Ray Image Retrieval System Using Partial Shape Matching
In recent years, there has been a rapid increase in the size and number of medical image collections. Thus, the development of appropriate methods for medical information retrieval...
Xiaoqian Xu, Dah-Jye Lee, Sameer Antani, L. Rodney...
BMCBI
2007
172views more  BMCBI 2007»
13 years 5 months ago
msBayes: Pipeline for testing comparative phylogeographic histories using hierarchical approximate Bayesian computation
Background: Although testing for simultaneous divergence (vicariance) across different population-pairs that span the same barrier to gene flow is of central importance to evoluti...
Michael J. Hickerson, Eli Stahl, Naoki Takebayashi