Search Sciweavers | Sciweavers

1110 search results - page 104 / 222

» New Approximation Results for Resource Replication Problems

110

Voted

ICRA
2006
IEEE

93views Robotics» more ICRA 2006»

A SVM-based Method for Engine Maintenance Strategy Optimization

15 years 11 months ago

Download www.cfins.au.tsinghua.edu.cn

— Due to the abundant application background, the optimization of maintenance problem has been extensively studied in the past decades. Besides the well-known difﬁculty of larg...

Qing-Shan Jia, Qianchuan Zhao

claim paper

Read More »

133

click to vote

JDA
2011

90views more JDA 2011»

Pattern matching in pseudo real-time

14 years 12 months ago

Download www.cs.bris.ac.uk

It has recently been shown how to construct online, non-amortised approximate pattern matching algorithms for a class of problems whose distance functions can be classiﬁed as be...

Raphaël Clifford, Benjamin Sach

claim paper

Read More »

148

click to vote

WWW
2007
ACM

148views Internet Technology» more WWW 2007»

A scalable application placement controller for enterprise data centers

16 years 5 months ago

Download www2007.org

Given a set of machines and a set of Web applications with dynamically changing demands, an online application placement controller decides how many instances to run for each appl...

Chunqiang Tang, Malgorzata Steinder, Mike Spreitze...

claim paper

Read More »

176

click to vote

AAAI
2012

205views Intelligent Agents» more AAAI 2012»

Kernel-Based Reinforcement Learning on Representative States

13 years 7 months ago

Download www.bkveton.com

Markov decision processes (MDPs) are an established framework for solving sequential decision-making problems under uncertainty. In this work, we propose a new method for batchmod...

Branislav Kveton, Georgios Theocharous

claim paper

Read More »

167

Voted

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 6 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

« Prev « First page 104 / 222 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers