Search Sciweavers | Sciweavers

355 search results - page 13 / 71

» Online Learning and Exploiting Relational Models in Reinforc...

187

click to vote

APPINF
2003

220views Information Technology» more APPINF 2003»

Evolving High-Dimensional, Adaptive Camera-based Speed Sensors

15 years 7 months ago

Download www.imd.uni-rostock.de

This paper reviews some attempts that exploit a phenomenon, also known as motion parallax, to estimate the distance of closest approach of a moving object. Despite their success, ...

Ralf Salomon

claim paper

Read More »

149

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 7 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

135

click to vote

AAAI
2008

105views Intelligent Agents» more AAAI 2008»

Potential-based Shaping in Model-based Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

Potential-based shaping was designed as a way of introducing background knowledge into model-free reinforcement-learning algorithms. By identifying states that are likely to have ...

John Asmuth, Michael L. Littman, Robert Zinkov

claim paper

Read More »

181

click to vote

WWW
2009
ACM

200views Internet Technology» more WWW 2009»

Learning to recognize reliable users and content in social media with coupled mutual reinforcement

16 years 6 months ago

Download www.mathcs.emory.edu

Community Question Answering (CQA) has emerged as a popular forum for users to pose questions for other users to answer. Over the last few years, CQA portals such as Naver and Yah...

Jiang Bian, Yandong Liu, Ding Zhou, Eugene Agichte...

claim paper

Read More »

172

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

15 years 7 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

« Prev « First page 13 / 71 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers