Search Sciweavers | Sciweavers

82 search results - page 4 / 17

» Balancing Exploration and Exploitation in Learning to Rank O...

click to vote

EMNLP
2007

140views Natural Language Processing» more EMNLP 2007»

Incremental Text Structuring with Online Hierarchical Ranking

14 years 11 months ago

Download people.csail.mit.edu

Many emerging applications require documents to be repeatedly updated. Such documents include newsfeeds, webpages, and shared community resources such as Wikipedia. In this paper ...

Erdong Chen, Benjamin Snyder, Regina Barzilay

claim paper

Read More »

Voted

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

14 years 10 months ago

Download www.fil.ion.ucl.ac.uk

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

Voted

GECCO
2007
Springer

143views Optimization» more GECCO 2007»

Learning and exploiting knowledge in multi-agent task allocation problems

15 years 4 months ago

Download www.cs.bham.ac.uk

Imagine a group of cooperating agents attempting to allocate tasks amongst themselves without knowledge of their own capabilities. Over time, they develop a belief of their own sk...

Adam Campbell, Annie S. Wu

claim paper

Read More »

Voted

ICRA
2010
IEEE

150views Robotics» more ICRA 2010»

Balancing state-space coverage in planning with dynamics

14 years 8 months ago

Download www.cse.unr.edu

— Sampling-based kinodynamic planners, such as the popular RRT algorithm, have been proposed as promising solutions to planning for systems with dynamics. Nevertheless, complex s...

Yanbo Li, Kostas E. Bekris

claim paper

Read More »

Voted

ICML
2005
IEEE

196views Machine Learning» more ICML 2005»

Bayesian sparse sampling for on-line reward optimization

15 years 11 months ago

Download www.cs.ualberta.ca

We present an efficient "sparse sampling" technique for approximating Bayes optimal decision making in reinforcement learning, addressing the well known exploration vers...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

« Prev « First page 4 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers