Sciweavers

274 search results - page 15 / 55
» Network reinforcement
Sort
View
NN
2002
Springer
113views Neural Networks» more  NN 2002»
14 years 9 months ago
Control of exploitation-exploration meta-parameter in reinforcement learning
In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...
Shin Ishii, Wako Yoshida, Junichiro Yoshimoto
IANDO
2010
112views more  IANDO 2010»
14 years 4 months ago
Generative mechanisms for innovation in information infrastructures
This paper investigates how innovation of ICT based services takes place within existing infrastructures, including the whole network of technology, vendors and customers. Our res...
Bendik Bygstad
NIPS
1990
14 years 10 months ago
Planning with an Adaptive World Model
We present a new connectionist planning method TML90 . By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving ...
Sebastian Thrun, Knut Möller, Alexander Linde...
IAT
2008
IEEE
14 years 9 months ago
Scaling Up Multi-agent Reinforcement Learning in Complex Domains
TD-FALCON (Temporal Difference - Fusion Architecture for Learning, COgnition, and Navigation) is a class of self-organizing neural networks that incorporates Temporal Difference (...
Dan Xiao, Ah-Hwee Tan
68
Voted
CIKM
2000
Springer
15 years 1 months ago
Relevance and Reinforcement in Interactive Browsing
We consider the problem of browsing the top ranked portion of the documents returned by an information retrieval system. We describe an interactive relevance feedback agent that a...
Anton Leuski