Search Sciweavers | Sciweavers

2683 search results - page 192 / 537

» Machine learning problems from optimization perspective

108

click to vote

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Internal Rewards Mitigate Agent Boundedness

15 years 7 months ago

Download www-personal.umich.edu

Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...

Jonathan Sorg, Satinder P. Singh, Richard Lewis

claim paper

Read More »

136

click to vote

ICML
2003
IEEE

156views Machine Learning» more ICML 2003»

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon

16 years 7 months ago

Download www-2.cs.cmu.edu

A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

160

click to vote

COLT
2007
Springer

117views Machine Learning» more COLT 2007»

Resource-Bounded Information Gathering for Correlation Clustering

16 years 13 days ago

Download www.cs.umass.edu

We present a new class of problems, called resource-bounded information gathering for correlation clustering. Our goal is to perform correlation clustering under circumstances in w...

Pallika Kanani, Andrew McCallum

claim paper

Read More »

169

click to vote

NIPS
1993

134views Information Technology» more NIPS 1993»

Using Local Trajectory Optimizers to Speed Up Global Optimization in Dynamic Programming

15 years 7 months ago

Download www.cs.cmu.edu

Dynamic programming provides a methodology to develop planners and controllers for nonlinear systems. However, general dynamic programming is computationally intractable. We have ...

Christopher G. Atkeson

claim paper

Read More »

171

click to vote

ICML
2007
IEEE

166views Machine Learning» more ICML 2007»

Classifying matrices with a spectral regularization

16 years 7 months ago

Download www.machinelearning.org

We propose a method for the classification of matrices. We use a linear classifier with a novel regularization scheme based on the spectral 1-norm of its coefficient matrix. The s...

Ryota Tomioka, Kazuyuki Aihara

claim paper

Read More »

« Prev « First page 192 / 537 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers