Search Sciweavers | Sciweavers

1383 search results - page 117 / 277

» The Convergence of Iterated Classification

145

click to vote

COLT
2008
Springer

103views Machine Learning» more COLT 2008»

More Efficient Internal-Regret-Minimizing Algorithms

15 years 6 months ago

Download www.cs.brown.edu

Standard no-internal-regret (NIR) algorithms compute a fixed point of a matrix, and hence typically require O(n3 ) run time per round of learning, where n is the dimensionality of...

Amy R. Greenwald, Zheng Li, Warren Schudy

claim paper

Read More »

141

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 5 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

145

click to vote

CORR
2007
Springer

128views Education» more CORR 2007»

Equivalence of LP Relaxation and Max-Product for Weighted Matching in General Graphs

15 years 4 months ago

Download www.mit.edu

— Max-product belief propagation is a local, iterative algorithm to ﬁnd the mode/MAP estimate of a probability distribution. While it has been successfully employed in a wide v...

Sujay Sanghavi

claim paper

Read More »

150

click to vote

TSP
2008

126views more TSP 2008»

HOS-Based Semi-Blind Spatial Equalization for MIMO Rayleigh Fading Channels

15 years 4 months ago

Download www.staff.ncl.ac.uk

In this paper we concentrate on the direct semi-blind spatial equalizer design for MIMO systems with Rayleigh fading channels. Our aim is to develop an algorithm which can outperf...

Zhiguo Ding, Tharmalingam Ratnarajah, Colin Cowan

claim paper

Read More »

186

click to vote

AI
2002
Springer

171views Artificial Intelligence» more AI 2002»

Multiagent learning using a variable learning rate

15 years 4 months ago

Download www.cs.cmu.edu

Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 117 / 277 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers