Search Sciweavers | Sciweavers

473 search results - page 95 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

ROBOCUP
2007
Springer

99views Robotics» more ROBOCUP 2007»

Instance-Based Action Models for Fast Action Planning

13 years 11 months ago

Download userweb.cs.utexas.edu

Abstract. Two main challenges of robot action planning in real domains are uncertain action eﬀects and dynamic environments. In this paper, an instance-based action model is lear...

Mazda Ahmadi, Peter Stone

claim paper

Read More »

click to vote

COLT
2008
Springer

179views Machine Learning» more COLT 2008»

Adapting to a Changing Environment: the Brownian Restless Bandits

13 years 6 months ago

Download research.microsoft.com

In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...

Aleksandrs Slivkins, Eli Upfal

claim paper

Read More »

click to vote

SIGECOM
2010
ACM

183views ECommerce» more SIGECOM 2010»

The unavailable candidate model: a decision-theoretic view of social choice

13 years 9 months ago

Download www.cs.toronto.edu

One of the fundamental problems in the theory of social choice is aggregating the rankings of a set of agents (or voters) into a consensus ranking. Rank aggregation has found appl...

Tyler Lu, Craig Boutilier

claim paper

Read More »

« Prev « First page 95 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers