Search Sciweavers | Sciweavers

233 search results - page 8 / 47

» Composing and combining policies under the policy machine

Voted

ICML
2003
IEEE

151views Machine Learning» more ICML 2003»

Hierarchical Policy Gradient Algorithms

16 years 14 days ago

Download www.hpl.hp.com

Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

Voted

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

16 years 14 days ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

115

click to vote

ICAC
2008
IEEE

123views Applied Computing» more ICAC 2008»

Generating Adaptation Policies for Multi-tier Applications in Consolidated Server Environments

15 years 6 months ago

Download www.cc.gatech.edu

Creating good adaptation policies is critical to building complex autonomic systems since it is such policies that deﬁne the system conﬁguration used in any given situation. W...

Gueyoung Jung, Kaustubh R. Joshi, Matti A. Hiltune...

claim paper

Read More »

125

click to vote

ICML
1995
IEEE

213views Machine Learning» more ICML 1995»

Learning Policies for Partially Observable Environments: Scaling Up

16 years 14 days ago

Download reference.kfupm.edu.sa

Partially observable Markov decision processes (pomdp's) model decision problems in which an agent tries to maximize its reward in the face of limited and/or noisy sensor fee...

Michael L. Littman, Anthony R. Cassandra, Leslie P...

claim paper

Read More »

106

Voted

AAAI
2011

122views Intelligent Agents» more AAAI 2011»

User-Controllable Learning of Location Privacy Policies With Gaussian Mixture Models

13 years 11 months ago

Download www.andrew.cmu.edu

With smart-phones becoming increasingly commonplace, there has been a subsequent surge in applications that continuously track the location of users. However, serious privacy conc...

Justin Cranshaw, Jonathan Mugan, Norman M. Sadeh

claim paper

Read More »

« Prev « First page 8 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers