Search Sciweavers | Sciweavers

24 search results - page 2 / 5

» Learning Policy Improvements with Path Integrals

click to vote

SOSP
2007
ACM

138views Operating System» more SOSP 2007»

Improving file system reliability with I/O shepherding

14 years 2 months ago

Download www.cs.wisc.edu

We introduce a new reliability infrastructure for ﬁle systems called I/O shepherding. I/O shepherding allows a ﬁle system developer to craft nuanced reliability policies to de...

Haryadi S. Gunawi, Vijayan Prabhakaran, Swetha Kri...

claim paper

Read More »

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

13 years 6 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

click to vote

LCTRTS
2007
Springer

180views System Software» more LCTRTS 2007»

Integrated CPU and l2 cache voltage scaling using machine learning

13 years 11 months ago

Download www.cs.pitt.edu

Embedded systems serve an emerging and diverse set of applications. As a result, more computational and storage capabilities are added to accommodate ever more demanding applicati...

Nevine AbouGhazaleh, Alexandre Ferreira, Cosmin Ru...

claim paper

Read More »

click to vote

IJCAI
2007

135views Artificial Intelligence» more IJCAI 2007»

Using Learned Policies in Heuristic-Search Planning

13 years 6 months ago

Download www2.parc.com

Many current state-of-the-art planners rely on forward heuristic search. The success of such search typically depends on heuristic distance-to-the-goal estimates derived from the ...

Sung Wook Yoon, Alan Fern, Robert Givan

claim paper

Read More »

click to vote

ECAI
2006
Springer

89views Artificial Intelligence» more ECAI 2006»

Learning by Automatic Option Discovery from Conditionally Terminating Sequences

13 years 9 months ago

Download www.ceng.metu.edu.tr

Abstract. This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learn...

Sertan Girgin, Faruk Polat, Reda Alhajj

claim paper

Read More »

« Prev « First page 2 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers