Sciweavers

1227 search results - page 149 / 246
» Learning Rates for Q-Learning
Sort
View
CIKM
2009
Springer
15 years 4 months ago
A general magnitude-preserving boosting algorithm for search ranking
Traditional boosting algorithms for the ranking problems usually employ the pairwise approach and convert the document rating preference into a binary-value label, like RankBoost....
Chenguang Zhu, Weizhu Chen, Zeyuan Allen Zhu, Gang...
CEC
2007
IEEE
15 years 4 months ago
Evolving the best-response strategy to decide when to make a proposal
— This paper designed and developed negotiation agents with the distinguishing features of 1) conducting continuous time negotiation rather than discrete time negotiation, 2) lea...
Bo An, Kwang Mong Sim, Victor R. Lesser
IUI
2000
ACM
15 years 2 months ago
A perceptual assistant to do sound equalization
This paper describes an intelligent interface to assist in the expert perceptual task of sound equalization. This is commonly done by a sound engineer in a recording studio, live ...
Dale Reed
IWCLS
1999
Springer
15 years 2 months ago
An Adaptive Agent Based Economic Model
In this paper we describe a simple model of adaptive agents of different types, represented by Learning Classifier Systems (LCS), which make investment decisions about a risk fre...
Sonia Schulenburg, Peter Ross
COLT
1995
Springer
15 years 1 months ago
A Comparison of New and Old Algorithms for a Mixture Estimation Problem
We investigate the problem of estimating the proportion vector which maximizes the likelihood of a given sample for a mixture of given densities. We adapt a framework developed for...
David P. Helmbold, Yoram Singer, Robert E. Schapir...