Search Sciweavers | Sciweavers

453 search results - page 52 / 91

» Learning from actions not taken: a multiagent learning algor...

101

click to vote

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

15 years 6 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

112

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 5 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

click to vote

IJRR
2010

107views more IJRR 2010»

Non-parametric Learning to Aid Path Planning over Slopes

14 years 10 months ago

Download www.roboticsproceedings.org

— This paper addresses the problem of closing the loop from perception to action selection for unmanned ground vehicles, with a focus on navigating slopes. A new non-parametric l...

Sisir Karumanchi, Thomas Allen, Tim Bailey, Steve ...

claim paper

Read More »

click to vote

COLT
2008
Springer

140views Machine Learning» more COLT 2008»

Regret Bounds for Sleeping Experts and Bandits

15 years 1 months ago

Download colt2008.cs.helsinki.fi

We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...

Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...

claim paper

Read More »

124

click to vote

PODC
2009
ACM

259views Distributed and Parallel Com...» more PODC 2009»

Load balancing without regret in the bulletin board model

16 years 11 days ago

Download www.cs.cornell.edu

We analyze the performance of protocols for load balancing in distributed systems based on no-regret algorithms from online learning theory. These protocols treat load balancing a...

Éva Tardos, Georgios Piliouras, Robert D. K...

claim paper

Read More »

« Prev « First page 52 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers