Search Sciweavers | Sciweavers

3050 search results - page 92 / 610

» On-line Algorithms in Machine Learning

171

Voted

EMNLP
2011

164views Natural Language Processing» more EMNLP 2011»

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation

14 years 3 months ago

Download cs.jhu.edu

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...

Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

claim paper

Read More »

Voted

ICML
2007
IEEE

162views Machine Learning» more ICML 2007»

Automatic shaping and decomposition of reward functions

16 years 4 months ago

Download www.machinelearning.org

This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...

Bhaskara Marthi

claim paper

Read More »

130

Voted

SEAL
1998
Springer

137views Machine Learning» more SEAL 1998»

Using Genetic Algorithms to Simulate the Evolution of an Oligopoly Game

15 years 7 months ago

Download www.aiecon.org

Abstract. This paper extends the N-person IPD game into a more interesting game in economics, namely, the oligopoly game. Due to its market share dynamics, the oligopoly game is mo...

Shu-Heng Chen, Chih-Chi Ni

claim paper

Read More »

139

Voted

JCST
2006

128views more JCST 2006»

Multi-Instance Learning from Supervised View

15 years 3 months ago

Download cs.nju.edu.cn

Abstract In multi-instance learning, the training set comprises labeled bags that are composed of unlabeled instances, and the task is to predict the labels of unseen bags. This pa...

Zhi-Hua Zhou

claim paper

Read More »

132

Voted

ML
2002
ACM

154views Machine Learning» more ML 2002»

Technical Update: Least-Squares Temporal Difference Learning

15 years 3 months ago

Download www.research.rutgers.edu

TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 92 / 610 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers