Search Sciweavers | Sciweavers

233 search results - page 7 / 47

» Composing and combining policies under the policy machine

Voted

ECML
2006
Springer

141views Machine Learning» more ECML 2006»

Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks

15 years 3 months ago

Download www.montefiore.ulg.ac.be

Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...

Sébastien Jodogne, Cyril Briquet, Justus H....

claim paper

Read More »

Voted

HPDC
2000
IEEE

152views Distributed And Parallel Com...» more HPDC 2000»

Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness

15 years 4 months ago

Download reports-archive.adm.cs.cmu.edu

While the MPP is still the most common architecture in supercomputer centers today, a simpler and cheaper machine conﬁguration is growing increasingly common. This alternative s...

Bianca Schroeder, Mor Harchol-Balter

claim paper

Read More »

Voted

POLICY
2004
Springer

84views Computer Networks» more POLICY 2004»

A Decentralized Treatment of a Highly Distributed Chinese-Wall Policy

15 years 5 months ago

Download www.cs.rutgers.edu

Access control (AC) technology has come a long way from its roots as the means for sharing resources between processes running on a single machine, to a mechanism for regulating t...

Naftaly H. Minsky

claim paper

Read More »

click to vote

CSFW
2010
IEEE

187views Security Privacy» more CSFW 2010»

Towards Quantitative Analysis of Proofs of Authorization: Applications, Framework, and Techniques

15 years 3 months ago

Download www.cs.pitt.edu

—Although policy compliance testing is generally treated as a binary decision problem, the evidence gathered during the trust management process can actually be used to examine t...

Adam J. Lee, Ting Yu

claim paper

Read More »

Voted

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 5 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 7 / 47 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers