Search Sciweavers | Sciweavers

567 search results - page 60 / 114

» Regularized Policy Iteration

231

click to vote

MP
2011

250views Intelligent Agents» more MP 2011»

An interior-point piecewise linear penalty method for nonlinear programming

14 years 6 months ago

Download www.corc.ieor.columbia.edu

We present an interior-point penalty method for nonlinear programming (NLP), where the merit function consists of a piecewise linear penalty function (PLPF) and an 2-penalty functi...

Lifeng Chen, Donald Goldfarb

claim paper

Read More »

155

click to vote

HPCN
1997
Springer

159views Distributed And Parallel Com...» more HPCN 1997»

Parallel Solution of Irregular, Sparse Matrix Problems Using High Performance Fortran

15 years 8 months ago

Download www.math.vt.edu

For regular, sparse, linear systems, like those derived from regular grids, using High Performance Fortran (HPF) for iterative solvers is straightforward. However, for irregular ma...

Eric de Sturler, Damian Loher

claim paper

Read More »

164

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 4 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

145

Voted

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 5 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

100

click to vote

SISAP
2008
IEEE

98views Data Mining» more SISAP 2008»

On Reinsertions in M-tree

15 years 10 months ago

Download siret.ms.mff.cuni.cz

In this paper we introduce a new M-tree building method, utilizing the classic idea of forced reinsertions. In case a leaf is about to split, some distant objects are removed from...

Jakub Lokoc, Tomás Skopal

claim paper

Read More »

« Prev « First page 60 / 114 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers