Search Sciweavers | Sciweavers

159

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

15 years 5 months ago

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

212

click to vote

IANDO
2010

112views more IANDO 2010»

Generative mechanisms for innovation in information infrastructures

15 years 11 days ago

Download is2.lse.ac.uk

This paper investigates how innovation of ICT based services takes place within existing infrastructures, including the whole network of technology, vendors and customers. Our res...

Bendik Bygstad

claim paper

Read More »

141

click to vote

NIPS
1990

102views Information Technology» more NIPS 1990»

Planning with an Adaptive World Model

15 years 6 months ago

Download www.ri.cmu.edu

We present a new connectionist planning method TML90 . By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving ...

Sebastian Thrun, Knut Möller, Alexander Linde...

claim paper

Read More »

163

click to vote

IAT
2008
IEEE

161views Intelligent Agents» more IAT 2008»

Scaling Up Multi-agent Reinforcement Learning in Complex Domains

15 years 5 months ago

Download www3.ntu.edu.sg

TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...

Dan Xiao, Ah-Hwee Tan

claim paper

Read More »

122

click to vote

CIKM
2000
Springer

104views Information Technology» more CIKM 2000»

Relevance and Reinforcement in Interactive Browsing

15 years 9 months ago

Download ciir.cs.umass.edu

We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...

Anton Leuski

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers