Search Sciweavers | Sciweavers

11 search results - page 2 / 3

» Reinforcement learning by reward-weighted regression for ope...

click to vote

HIS
2004

196views Information Technology» more HIS 2004»

Reinforcement Learning Hierarchical Neuro-Fuzzy Politree Model for Control of Autonomous Agents

13 years 6 months ago

Download ducati.doc.ntu.ac.uk

: This work presents a new hybrid neuro-fuzzy model for automatic learning of actions taken by agents. The main objective of this new model is to provide an agent with intelligence...

Karla Figueiredo, Marley B. R. Vellasco, Marco Aur...

claim paper

Read More »

click to vote

IROS
2008
IEEE

144views Robotics» more IROS 2008»

Learning nonparametric policies by imitation

13 years 11 months ago

Download www.cs.washington.edu

— A long cherished goal in artiﬁcial intelligence has been the ability to endow a robot with the capacity to learn and generalize skills from watching a human teacher. Such an ...

David B. Grimes, Rajesh P. N. Rao

claim paper

Read More »

click to vote

DAGSTUHL
2001

176views Software Engineering» more DAGSTUHL 2001»

Decision-Theoretic Control of Planetary Rovers

13 years 6 months ago

Download anytime.cs.umass.edu

Planetary rovers are small unmanned vehicles equipped with cameras and a variety of sensors used for scientific experiments. They must operate under tight constraints over such res...

Shlomo Zilberstein, Richard Washington, Daniel S. ...

claim paper

Read More »

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

14 years 5 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

click to vote

GECCO
2006
Springer

177views Optimization» more GECCO 2006»

Hyper-ellipsoidal conditions in XCS: rotation, linear approximation, and solution structure

13 years 8 months ago

Download www.eskimo.com

The learning classifier system XCS is an iterative rulelearning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classifi...

Martin V. Butz, Pier Luca Lanzi, Stewart W. Wilson

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers