Search Sciweavers | Sciweavers

1364 search results - page 187 / 273

» Sampling Methods for Unsupervised Learning

143

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 3 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

150

click to vote

ECCV
2004
Springer

192views Computer Vision» more ECCV 2004»

A Constrained Semi-supervised Learning Approach to Data Association

16 years 6 months ago

Download people.cs.ubc.ca

Data association (obtaining correspondences) is a ubiquitous problem in computer vision. It appears when matching image features across multiple images, matching image features to ...

Hendrik Kück, Nando de Freitas, Peter Carbone...

claim paper

Read More »

153

click to vote

ML
2006
ACM

113views Machine Learning» more ML 2006»

Learning to bid in bridge

15 years 4 months ago

Download www.cs.technion.ac.il

Bridge bidding is considered to be one of the most difficult problems for game-playing programs. It involves four agents rather than two, including a cooperative agent. In additio...

Asaf Amit, Shaul Markovitch

claim paper

Read More »

151

click to vote

WSDM
2012
ACM

214views Data Mining» more WSDM 2012»

Selecting actions for resource-bounded information extraction using reinforcement learning

14 years 7 days ago

Download people.cs.umass.edu

Given a database with missing or uncertain content, our goal is to correct and ﬁll the database by extracting speciﬁc information from a large corpus such as the Web, and to d...

Pallika H. Kanani, Andrew K. McCallum

claim paper

Read More »

145

click to vote

ICML
2004
IEEE

207views Machine Learning» more ICML 2004»

Bayesian inference for transductive learning of kernel matrix using the Tanner-Wong data augmentation algorithm

16 years 5 months ago

Download www.cs.ust.hk

In kernel methods, an interesting recent development seeks to learn a good kernel from empirical data automatically. In this paper, by regarding the transductive learning of the k...

Zhihua Zhang, Dit-Yan Yeung, James T. Kwok

claim paper

Read More »

« Prev « First page 187 / 273 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers