Search Sciweavers | Sciweavers

121 search results - page 16 / 25

» Learning Decision Theoretic Utilities through Reinforcement ...

144

click to vote

ECCV
2010
Springer

251views Computer Vision» more ECCV 2010»

Discriminative Tracking by Metric Learning

15 years 3 months ago

Download www.eecs.northwestern.edu

We present a discriminative model that casts appearance modeling and visual matching into a single objective for visual tracking. Most previous discriminative models for visual tra...

claim paper

Read More »

136

Voted

AAMAS
2005
Springer

174views Intelligent Agents» more AAMAS 2005»

Cooperative Multi-Agent Learning: The State of the Art

14 years 11 months ago

Download cs.gmu.edu

Cooperative multi-agent systems are ones in which several agents attempt, through their interaction, to jointly solve tasks or to maximize utility. Due to the interactions among t...

Liviu Panait, Sean Luke

claim paper

Read More »

click to vote

ECAI
2000
Springer

102views Artificial Intelligence» more ECAI 2000»

Learning to Use Operational Advice

15 years 4 months ago

Download home.in.tum.de

We address the problem of advice-taking in a given domain, in particular for building a game-playing program. Our approach to solving it strives for the application of machine lea...

Johannes Fürnkranz, Bernhard Pfahringer, Herm...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 5 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

AR
2002

157views more AR 2002»

Acquiring state from control dynamics to learn grasping policies for robot hands

14 years 11 months ago

Download www.mit.edu

Abstract--A prominent emerging theory of sensorimotor development in biological systems proposes that control knowledge is encoded in the dynamics of physical interaction with the ...

Roderic A. Grupen, Jefferson A. Coelho Jr.

claim paper

Read More »

« Prev « First page 16 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers