Sciweavers

2011 search results - page 78 / 403
» Universal Reinforcement Learning
Sort
View
EWRL
2008
15 years 5 months ago
Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case
We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...
Kirill Dyagilev, Shie Mannor, Nahum Shimkin
AR
2004
84views more  AR 2004»
15 years 3 months ago
Reinforcement learning of humanoid rhythmic walking parameters based on visual information
This paper presents a method for learning the parameters of rhythmic walking to generate purposive humanoid motions. The controller consists of the two layers: rhythmic walking is...
Masaki Ogino, Yutaka Katoh, Masahiro Aono, Minoru ...
ICSTM
2000
103views Management» more  ICSTM 2000»
15 years 5 months ago
The worst failure: repeated failure to learn
Performance measurement systems based on the principle that "if you can't measure it, you can't manage it" reinforce a short-term culture by focussing on tangi...
Alan C. McLucas
KESAMSTA
2007
Springer
15 years 10 months ago
Reinforcement Learning on a Futures Market Simulator
: In recent years, market forecasting by machine learning methods has been flourishing. Most existing works use a past market data set, because they assume that each trader’s in...
Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...
VLSID
2005
IEEE
105views VLSI» more  VLSID 2005»
15 years 9 months ago
Placement and Routing for 3D-FPGAs Using Reinforcement Learning and Support Vector Machines
The primary advantage of using 3D-FPGA over 2D-FPGA is that the vertical stacking of active layers reduce the Manhattan distance between the components in 3D-FPGA than when placed...
R. Manimegalai, E. Siva Soumya, V. Muralidharan, B...