Sciweavers

82 search results - page 4 / 17
» Balancing Exploration and Exploitation in Learning to Rank O...
Sort
View
EMNLP
2007
14 years 11 months ago
Incremental Text Structuring with Online Hierarchical Ranking
Many emerging applications require documents to be repeatedly updated. Such documents include newsfeeds, webpages, and shared community resources such as Wikipedia. In this paper ...
Erdong Chen, Benjamin Snyder, Regina Barzilay
99
Voted
NN
2002
Springer
113views Neural Networks» more  NN 2002»
14 years 10 months ago
Control of exploitation-exploration meta-parameter in reinforcement learning
In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...
Shin Ishii, Wako Yoshida, Junichiro Yoshimoto
69
Voted
GECCO
2007
Springer
143views Optimization» more  GECCO 2007»
15 years 4 months ago
Learning and exploiting knowledge in multi-agent task allocation problems
Imagine a group of cooperating agents attempting to allocate tasks amongst themselves without knowledge of their own capabilities. Over time, they develop a belief of their own sk...
Adam Campbell, Annie S. Wu
69
Voted
ICRA
2010
IEEE
150views Robotics» more  ICRA 2010»
14 years 8 months ago
Balancing state-space coverage in planning with dynamics
— Sampling-based kinodynamic planners, such as the popular RRT algorithm, have been proposed as promising solutions to planning for systems with dynamics. Nevertheless, complex s...
Yanbo Li, Kostas E. Bekris
89
Voted
ICML
2005
IEEE
15 years 11 months ago
Bayesian sparse sampling for on-line reward optimization
We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...
Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...