Sciweavers

874 search results - page 68 / 175
» Iterative Learning Control - Monotonicity and Optimization
Sort
View
NIPS
1996
14 years 11 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
NIPS
2001
14 years 11 months ago
The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay
Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...
Michael Kositsky, Andrew G. Barto
ICML
1995
IEEE
15 years 10 months ago
Residual Algorithms: Reinforcement Learning with Function Approximation
A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...
Leemon C. Baird III
ANOR
2005
81views more  ANOR 2005»
14 years 9 months ago
Managing Stochastic, Finite Capacity, Multi-Project Systems through the Cross-Entropy Methodology
This paper addresses the problem of loading a finite capacity, stochastic (random) and dynamic multi-project system. The system is controlled by keeping a constant number of projec...
Izack Cohen, Boaz Golany, Avraham Shtub
MP
2006
103views more  MP 2006»
14 years 10 months ago
Assessing solution quality in stochastic programs
Determining if a solution is optimal or near optimal is fundamental in optimization theory, algorithms, and computation. For instance, Karush-Kuhn-Tucker conditions provide necessa...
Güzin Bayraksan, David P. Morton