Search Sciweavers | Sciweavers

51 search results - page 4 / 11

» Characterizing reinforcement learning methods through parame...

Voted

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 6 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

Voted

ECML
2004
Springer

100views Machine Learning» more ECML 2004»

Dynamic Asset Allocation Exploiting Predictors in Reinforcement Learning Framework

15 years 5 months ago

Download bi.snu.ac.kr

Given the pattern-based multi-predictors of the stock price, we study a method of dynamic asset allocation to maximize the trading performance. To optimize the proportion of asset ...

Jangmin O, Jae Won Lee, Jongwoo Lee, Byoung-Tak Zh...

claim paper

Read More »

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

15 years 5 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

Voted

ATAL
2009
Springer

167views Intelligent Agents» more ATAL 2009»

Solving multiagent assignment Markov decision processes

15 years 6 months ago

Download www.aamas-conference.org

We consider the setting of multiple collaborative agents trying to complete a set of tasks as assigned by a centralized controller. We propose a scalable method called“Assignmen...

Scott Proper, Prasad Tadepalli

claim paper

Read More »

click to vote

AR
2007

105views more AR 2007»

Reinforcement learning of a continuous motor sequence with hidden states

14 years 11 months ago

Download www.bdc.brain.riken.go.jp

—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...

Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...

claim paper

Read More »

« Prev « First page 4 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers