Search Sciweavers | Sciweavers

1455 search results - page 35 / 291

» Exploiting Myopic Learning

141

click to vote

IUI
2006
ACM

165views Software Engineering» more IUI 2006»

Who's asking for help?: a Bayesian approach to intelligent assistance

15 years 11 months ago

Download www.cs.toronto.edu

Automated software customization is drawing increasing attention as a means to help users deal with the scope, complexity, potential intrusiveness, and ever-changing nature of mod...

Bowen Hui, Craig Boutilier

claim paper

Read More »

153

click to vote

JACM
2006

93views more JACM 2006»

Combining expert advice in reactive environments

15 years 5 months ago

Download web.mit.edu

"Experts algorithms" constitute a methodology for choosing actions repeatedly, when the rewards depend both on the choice of action and on the unknown current state of t...

Daniela Pucci de Farias, Nimrod Megiddo

claim paper

Read More »

157

click to vote

JSAC
2007

189views more JSAC 2007»

Non-Cooperative Power Control for Wireless Ad Hoc Networks with Repeated Games

15 years 4 months ago

Download www.cs.ust.hk

— One of the distinctive features in a wireless ad hoc network is lack of any central controller or single point of authority, in which each node/link then makes its own decision...

Chengnian Long, Qian Zhang, Bo Li, Huilong Yang, X...

claim paper

Read More »

155

click to vote

CEC
2010
IEEE

216views Artificial Intelligence» more CEC 2010»

Coevolutionary Temporal Difference Learning for small-board Go

15 years 5 months ago

Download www.cs.put.poznan.pl

—In this paper we apply Coevolutionary Temporal Difference Learning (CTDL), a hybrid of coevolutionary search and reinforcement learning proposed in our former study, to evolve s...

Krzysztof Krawiec, Marcin Szubert

posted by mszubert

Read More »

204

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 22 days ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

« Prev « First page 35 / 291 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers