Sciweavers

1310 search results - page 100 / 262
» Progressive Optimization in Action
Sort
View
AIPS
2007
15 years 6 months ago
Cost-Sharing Approximations for h+
Relaxations based on (either complete or partial) ignoring delete effects of the actions provide the basis for some seminal classical planning heuristics. However, the palette of ...
Vitaly Mirkis, Carmel Domshlak
WSC
1998
15 years 5 months ago
Modeling Cardiac Ion Channel Conductivity: Model Fitting via Simulation
We describe a Markov state model for a cloned potassium channel of the human heart ( 1KvLQTI ). The parameters of the model are determined by a least-squares fit of predicted vs. ...
John L. Maryak, Richard H. Smith, Raimond L. Winsl...
FCCM
2006
IEEE
106views VLSI» more  FCCM 2006»
15 years 10 months ago
Scalable Hardware Architecture for Real-Time Dynamic Programming Applications
Abstract— This paper introduces a novel architecture for performing the core computations required by dynamic programming (DP) techniques. The latter pertain to a vast range of a...
Brad Matthews, Itamar Elhanany
AAAI
2006
15 years 6 months ago
Learning Basis Functions in Hybrid Domains
Markov decision processes (MDPs) with discrete and continuous state and action components can be solved efficiently by hybrid approximate linear programming (HALP). The main idea ...
Branislav Kveton, Milos Hauskrecht
127
Voted
IJCAI
2003
15 years 5 months ago
Multiple-Goal Reinforcement Learning with Modular Sarsa(0)
We present a new algorithm, GM-Sarsa(0), for finding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...
Nathan Sprague, Dana H. Ballard