Search Sciweavers | Sciweavers

132 search results - page 26 / 27

» Generalization in Reinforcement Learning: Safely Approximati...

click to vote

SAGA
2009
Springer

183views Control Systems» more SAGA 2009»

Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies

14 years 9 hour ago

Download www.montefiore.ulg.ac.be

We propose a generic method for obtaining quickly good upper bounds on the minimal value of a multistage stochastic program. The method is based on the simulation of a feasible dec...

Boris Defourny, Damien Ernst, Louis Wehenkel

claim paper

Read More »

click to vote

JMLR
2008

168views more JMLR 2008»

Max-margin Classification of Data with Absent Features

13 years 5 months ago

Download ai.stanford.edu

We consider the problem of learning classifiers in structured domains, where some objects have a subset of features that are inherently absent due to complex relationships between...

Gal Chechik, Geremy Heitz, Gal Elidan, Pieter Abbe...

claim paper

Read More »

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

13 years 6 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

click to vote

ECML
2006
Springer

132views Machine Learning» more ECML 2006»

Prioritizing Point-Based POMDP Solvers

13 years 9 months ago

Download www.cs.bgu.ac.il

Recent scaling up of POMDP solvers towards realistic applications is largely due to point-based methods such as PBVI, Perseus, and HSVI, which quickly converge to an approximate so...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

click to vote

GECCO
2008
Springer

172views Optimization» more GECCO 2008»

Recursive least squares and quadratic prediction in continuous multistep problems

13 years 6 months ago

Download www.cs.bham.ac.uk

XCS with computed prediction, namely XCSF, has been recently extended in several ways. In particular, a novel prediction update algorithm based on recursive least squares and the ...

Daniele Loiacono, Pier Luca Lanzi

claim paper

Read More »

« Prev « First page 26 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers