Search Sciweavers | Sciweavers

1235 search results - page 56 / 247

» ABC Reinforcement Learning

207

click to vote

JDCTA
2010

160views more JDCTA 2010»

Learning and Decision Making in Human During a Game of Matching Pennies

15 years 1 months ago

Download www.aicit.org

To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...

Jianfeng Hu, Xiaofeng Li, Jinghai Yin

claim paper

Read More »

152

click to vote

PDPTA
2003

110views Distributed And Parallel Com...» more PDPTA 2003»

Java Resources for Teaching Reinforcement Learning

15 years 8 months ago

Download cs.gettysburg.edu

— In this paper we present a library of classes for programming reinforcement learning simulations in Java. This library is based upon the standard by Sutton and Santamaria [1], ...

Amy J. Kerr, Todd W. Neller, Christopher J. La Pil...

claim paper

Read More »

208

click to vote

AGI
2011

222views Artificial Intelligence» more AGI 2011»

Measuring Agent Intelligence via Hierarchies of Environments

14 years 10 months ago

Download www.ssec.wisc.edu

Under Legg’s and Hutter’s formal measure [1], performance in easy environments counts more toward an agent’s intelligence than does performance in difficult environments. An ...

Bill Hibbard

claim paper

Read More »

227

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

15 years 7 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

181

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 5 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

« Prev « First page 56 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers