Search Sciweavers | Sciweavers

2683 search results - page 129 / 537

» Machine learning problems from optimization perspective

187

Voted

AAAI
1998

170views Intelligent Agents» more AAAI 1998»

The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems

15 years 8 months ago

Download opim.wharton.upenn.edu

Reinforcement learning can provide a robust and natural means for agents to learn how to coordinate their action choices in multiagent systems. We examine some of the factors that...

Caroline Claus, Craig Boutilier

claim paper

Read More »

202

click to vote

EUROCOLT
1999
Springer

119views Machine Learning» more EUROCOLT 1999»

Query by Committee, Linear Separation and Random Walks

15 years 11 months ago

Download www.cs.huji.ac.il

Abstract. Recent works have shown the advantage of using Active Learning methods, such as the Query by Committee (QBC) algorithm, to various learning problems. This class of Algori...

Ran Bachrach, Shai Fine, Eli Shamir

claim paper

Read More »

208

click to vote

ECML
2006
Springer

153views Machine Learning» more ECML 2006»

An Adaptive Kernel Method for Semi-supervised Clustering

15 years 10 months ago

Download cs.gmu.edu

Semi-supervised clustering uses the limited background knowledge to aid unsupervised clustering algorithms. Recently, a kernel method for semi-supervised clustering has been introd...

Bojun Yan, Carlotta Domeniconi

claim paper

Read More »

280

click to vote

SIGMOD
2007
ACM

197views Database» more SIGMOD 2007»

Automated and on-demand provisioning of virtual machines for database applications

16 years 7 months ago

Download www.cs.duke.edu

Utility computing delivers compute and storage resources to applications as an `on-demand utility', much like electricity, from a distributed collection of computing resource...

Piyush Shivam, Azbayar Demberel, Pradeep Gunda, Da...

claim paper

Read More »

151

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 11 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

« Prev « First page 129 / 537 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers