Search Sciweavers | Sciweavers

2011 search results - page 78 / 403

» Universal Reinforcement Learning

151

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 5 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

139

click to vote

AR
2004

84views more AR 2004»

Reinforcement learning of humanoid rhythmic walking parameters based on visual information

15 years 3 months ago

Download www.er.ams.eng.osaka-u.ac.jp

This paper presents a method for learning the parameters of rhythmic walking to generate purposive humanoid motions. The controller consists of the two layers: rhythmic walking is...

Masaki Ogino, Yutaka Katoh, Masahiro Aono, Minoru ...

claim paper

Read More »

228

click to vote

ICSTM
2000

103views Management» more ICSTM 2000»

The worst failure: repeated failure to learn

15 years 5 months ago

Download www.aes.asn.au

Performance measurement systems based on the principle that "if you can't measure it, you can't manage it" reinforce a short-term culture by focussing on tangi...

Alan C. McLucas

claim paper

Read More »

111

click to vote

KESAMSTA
2007
Springer

129views Intelligent Agents» more KESAMSTA 2007»

Reinforcement Learning on a Futures Market Simulator

15 years 10 months ago

Download www.jucs.org

: In recent years, market forecasting by machine learning methods has been ﬂourishing. Most existing works use a past market data set, because they assume that each trader’s in...

Koichi Moriyama, Mitsuhiro Matsumoto, Ken-ichi Fuk...

claim paper

Read More »

136

click to vote

VLSID
2005
IEEE

105views VLSI» more VLSID 2005»

Placement and Routing for 3D-FPGAs Using Reinforcement Learning and Support Vector Machines

15 years 9 months ago

Download www.cse.psu.edu

The primary advantage of using 3D-FPGA over 2D-FPGA is that the vertical stacking of active layers reduce the Manhattan distance between the components in 3D-FPGA than when placed...

R. Manimegalai, E. Siva Soumya, V. Muralidharan, B...

claim paper

Read More »

« Prev « First page 78 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers