Search Sciweavers | Sciweavers

1474 search results - page 86 / 295

» Using Machine Learning to Focus Iterative Optimization

168

click to vote

INFOCOM
2007
IEEE

96views Communications» more INFOCOM 2007»

Toward Tractable Computation of the Capacity of Multi-Hop Wireless Networks

16 years 15 days ago

Download www.eecis.udel.edu

— By posing the problem of bandwidth allocation as a constrained maximization problem, it is possible to study various features of optimal bandwidth allocation, and hence the cap...

Stephan Bohacek, Peng Wang

claim paper

Read More »

173

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

170

Voted

CEAS
2006
Springer

147views Internet Technology» more CEAS 2006»

Fast Uncertainty Sampling for Labeling Large E-mail Corpora

15 years 10 months ago

Download www.ceas.cc

One of the biggest challenges in building effective anti-spam solutions is designing systems to defend against the everevolving bag of tricks spammers use to defeat them. Because ...

Richard Segal, Ted Markowitz, William Arnold

claim paper

Read More »

180

click to vote

KDD
2002
ACM

169views Data Mining» more KDD 2002»

Optimizing search engines using clickthrough data

16 years 6 months ago

Download www.cs.cornell.edu

This paper presents an approach to automatically optimizing the retrieval quality of search engines using clickthrough data. Intuitively, a good information retrieval system shoul...

Thorsten Joachims

claim paper

Read More »

172

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 7 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

« Prev « First page 86 / 295 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers