Search Sciweavers | Sciweavers

82 search results - page 2 / 17

» Balancing Exploration and Exploitation in Learning to Rank O...

Voted

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

15 years 1 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

126

Voted

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

14 years 8 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

Voted

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

15 years 11 months ago

Download paul.rutgers.edu

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

105

click to vote

EMNLP
2009

162views Natural Language Processing» more EMNLP 2009»

Empirical Exploitation of Click Data for Task Specific Ranking

14 years 8 months ago

Download www.aclweb.org

There have been increasing needs for task specific rankings in web search such as rankings for specific query segments like long queries, time-sensitive queries, navigational quer...

Anlei Dong, Yi Chang, Shihao Ji, Ciya Liao, Xin Li...

claim paper

Read More »

152

click to vote

CVPR
2012
IEEE

358views Computer Vision» more CVPR 2012»

Stream-based Joint Exploration-Exploitation Active Learning

13 years 3 months ago

Download www.eecs.qmul.ac.uk

Learning from streams of evolving and unbounded data is an important problem, for example in visual surveillance or internet scale data. For such large and evolving real-world data...

Chen Change Loy, Timothy M. Hospedales, Tao Xiang,...

posted by ccloy

Read More »

« Prev « First page 2 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers