policy | Sciweavers

34

IJCAI
2003

147views Artificial Intelligence» more IJCAI 2003»

Approximate Policy Iteration using Large-Margin Classifiers

13 years 10 months ago

We present an approximate policy iteration algorithm that uses rollouts to estimate the value of each action under a given policy in a subset of states and a classifier to general...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

26

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

13 years 10 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

23

click to vote

SEC
2008

105views Security Privacy» more SEC 2008»

Negotiation of Prohibition: An Approach Based on Policy Rewriting

13 years 10 months ago

Download www.orbac.org

Abstract. In recent security architectures, it is possible that the security policy is not evaluated in a centralized way but requires negotiation between the subject who is reques...

Nora Cuppens-Boulahia, Frédéric Cupp...

claim paper

Read More »

27

click to vote

IM
2007

82views Computer Networks» more IM 2007»

Issues in Designing a Policy Language for Distributed Management of IT Infrastructures

13 years 10 months ago

Download domino.research.ibm.com

— The objectives of this paper are twofold. First, we introduce a novel policy language, called CIM-SPL (Simple Policy Language for CIM) that complies with the CIM (Common Inform...

Dakshi Agrawal, Seraphin B. Calo, Kang-Won Lee, Jo...

claim paper

Read More »

30

click to vote

DBSEC
2010

126views Database» more DBSEC 2010»

Mining Likely Properties of Access Control Policies via Association Rule Mining

13 years 10 months ago

Download people.engr.ncsu.edu

Abstract. Access control mechanisms are used to control which principals (such as users or processes) have access to which resources based on access control policies. To ensure the...

JeeHyun Hwang, Tao Xie, Vincent C. Hu, Mine Altuna...

claim paper

Read More »

34

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

13 years 10 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

34

click to vote

CCS
2008
ACM

140views Security Privacy» more CCS 2008»

User-controllable learning of security and privacy policies

13 years 11 months ago

Download patrickgagekelley.com

Studies have shown that users have great difficulty specifying their security and privacy policies in a variety of application domains. While machine learning techniques have succ...

Patrick Gage Kelley, Paul Hankes Drielsma, Norman ...

claim paper

Read More »

26

click to vote

ATAL
2008
Springer

123views Intelligent Agents» more ATAL 2008»

Sigma point policy iteration

13 years 11 months ago

Download web.mit.edu

In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...

Michael H. Bowling, Alborz Geramifard, David Winga...

claim paper

Read More »

33

click to vote

WSC
2007

137views Modeling And Simulation» more WSC 2007»

A hybrid inventory control system approach applied to the food industry

13 years 11 months ago

Download www.informs-sim.org

The appropriate production and inventory control policy is a key factor for modern enterprises’ success in competitive environment. In the food industry, most of food manufactur...

David Claudio, Jie Zhang, Ying Zhang

claim paper

Read More »

36

click to vote

AIPS
2010

174views Artificial Intelligence» more AIPS 2010»

When Policies Can Be Trusted: Analyzing a Criteria to Identify Optimal Policies in MDPs with Unknown Model Parameters

13 years 11 months ago

Download www.cs.berkeley.edu

Computing a good policy in stochastic uncertain environments with unknown dynamics and reward model parameters is a challenging task. In a number of domains, ranging from space ro...

Emma Brunskill

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers