Sciweavers

892 search results - page 133 / 179
» Action respecting embedding
Sort
View
118
Voted
ICML
2010
IEEE
15 years 1 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
99
Voted
COMBINATORICS
2006
126views more  COMBINATORICS 2006»
15 years 19 days ago
Constructions of Representations of Rank Two Semisimple Lie Algebras with Distributive Lattices
We associate one or two posets (which we call "semistandard posets") to any given irreducible representation of a rank two semisimple Lie algebra over C. Elsewhere we ha...
L. Wyatt Alverson II, Robert G. Donnelly, Scott J....
IJCSS
2006
116views more  IJCSS 2006»
15 years 18 days ago
Extracting Motor Unit Firing Information by Independent Component Analysis of Surface Electromyogram: A Preliminary Study Using
Decomposition of electromyogram (EMG) provides a valuable means of obtaining motor unit recruitment and firing rate information. The feasibility of decomposing surface EMG signals...
Ping Zhou, M. M. Lowery, W. Zev Rymer
125
Voted
JFR
2007
150views more  JFR 2007»
15 years 15 days ago
Decisional autonomy of planetary rovers
To achieve the ever increasing demand for science return, planetary exploration rovers require more autonomy to successfully perform their missions. Indeed, the communication dela...
Félix Ingrand, Simon Lacroix, Solange Lemai...
157
Voted
AI
1998
Springer
15 years 9 days ago
Model-Based Average Reward Reinforcement Learning
Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...
Prasad Tadepalli, DoKyeong Ok