Sciweavers

3050 search results - page 92 / 610
» On-line Algorithms in Machine Learning
Sort
View
171
Voted
EMNLP
2011
14 years 3 months ago
Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation
We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...
Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...
97
Voted
ICML
2007
IEEE
16 years 4 months ago
Automatic shaping and decomposition of reward functions
This paper investigates the problem of automatically learning how to restructure the reward function of a Markov decision process so as to speed up reinforcement learning. We begi...
Bhaskara Marthi
130
Voted
SEAL
1998
Springer
15 years 7 months ago
Using Genetic Algorithms to Simulate the Evolution of an Oligopoly Game
Abstract. This paper extends the N-person IPD game into a more interesting game in economics, namely, the oligopoly game. Due to its market share dynamics, the oligopoly game is mo...
Shu-Heng Chen, Chih-Chi Ni
139
Voted
JCST
2006
128views more  JCST 2006»
15 years 3 months ago
Multi-Instance Learning from Supervised View
Abstract In multi-instance learning, the training set comprises labeled bags that are composed of unlabeled instances, and the task is to predict the labels of unseen bags. This pa...
Zhi-Hua Zhou
132
Voted
ML
2002
ACM
154views Machine Learning» more  ML 2002»
15 years 3 months ago
Technical Update: Least-Squares Temporal Difference Learning
TD() is a popular family of algorithms for approximate policy evaluation in large MDPs. TD() works by incrementally updating the value function after each observed transition. It h...
Justin A. Boyan