Search Sciweavers | Sciweavers

1177 search results - page 37 / 236

» Iterative methods for Robbins problems

115

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

14 years 9 months ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

105

click to vote

ICASSP
2008
IEEE

161views Signal Processing» more ICASSP 2008»

Discriminative training by iterative linear programming optimization

15 years 9 months ago

Download www.cs.ust.hk

In this paper, we cast discriminative training problems into standard linear programming (LP) optimization. Besides being convex and having globally optimal solution(s), LP progra...

Brian Mak, Benny Ng

claim paper

Read More »

118

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 4 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

121

click to vote

ICML
2006
IEEE

125views Machine Learning» more ICML 2006»

Iterative RELIEF for feature weighting

16 years 3 months ago

Download plaza.ufl.edu

RELIEF is considered one of the most successful algorithms for assessing the quality of features. In this paper, we propose a set of new feature weighting algorithms that perform s...

Yijun Sun, Jian Li

claim paper

Read More »

119

Voted

GLOBECOM
2006
IEEE

120views Communications» more GLOBECOM 2006»

Hierarchical Iterative Algorithm for a Coupled Constrained OSNR Nash Game

15 years 9 months ago

Download www.control.utoronto.ca

— This paper develops a hierarchical iterative OSNR algorithm based on a game theory framework. A Nash game is formulated between channels with channel utility related to maximiz...

Lacra Pavel

claim paper

Read More »

« Prev « First page 37 / 236 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers