Search Sciweavers | Sciweavers

340 search results - page 62 / 68

» Kernelized value function approximation for reinforcement le...

191

click to vote

ATAL
2004
Springer

97views Intelligent Agents» more ATAL 2004»

Unifying Temporal and Structural Credit Assignment Problems

16 years 14 days ago

Download ti.arc.nasa.gov

Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

243

click to vote

ML
2008
ACM

248views Machine Learning» more ML 2008»

Feature selection via sensitivity analysis of SVM probabilistic outputs

15 years 7 months ago

Download guppy.mpe.nus.edu.sg

Feature selection is an important aspect of solving data-mining and machine-learning problems. This paper proposes a feature-selection method for the Support Vector Machine (SVM) l...

Kai Quan Shen, Chong Jin Ong, Xiao Ping Li, Einar ...

claim paper

Read More »

266

click to vote

SAGA
2009
Springer

183views Control Systems» more SAGA 2009»

Bounds for Multistage Stochastic Programs Using Supervised Learning Strategies

16 years 1 months ago

Download www.montefiore.ulg.ac.be

We propose a generic method for obtaining quickly good upper bounds on the minimal value of a multistage stochastic program. The method is based on the simulation of a feasible dec...

Boris Defourny, Damien Ernst, Louis Wehenkel

claim paper

Read More »

199

click to vote

SCALESPACE
2005
Springer

111views Computer Vision» more SCALESPACE 2005»

Vortex and Source Particles for Fluid Motion Estimation

16 years 17 days ago

Download www-labsticc.univ-ubs.fr

In this paper we propose a new motion estimator for image sequences depicting ﬂuid ﬂows. The proposed estimator is based on the Helmholtz decomposition of vector ﬁelds. This ...

Anne Cuzol, Étienne Mémin

claim paper

Read More »

217

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 8 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 62 / 68 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers