Sciweavers

2436 search results - page 175 / 488
» Evaluating Adaptive Problem Selection
Sort
View
160
Voted
ICML
2000
IEEE
16 years 4 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
131
Voted
TSMC
2011
258views more  TSMC 2011»
14 years 10 months ago
Cross-Entropy Optimization of Control Policies With Adaptive Basis Functions
—This paper introduces an algorithm for direct search of control policies in continuous-state discrete-action Markov decision processes. The algorithm looks for the best closed-l...
Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...
152
Voted
KDD
2009
ACM
230views Data Mining» more  KDD 2009»
16 years 4 months ago
Cross domain distribution adaptation via kernel mapping
When labeled examples are limited and difficult to obtain, transfer learning employs knowledge from a source domain to improve learning accuracy in the target domain. However, the...
ErHeng Zhong, Wei Fan, Jing Peng, Kun Zhang, Jiang...
116
Voted
HICSS
2002
IEEE
115views Biometrics» more  HICSS 2002»
15 years 8 months ago
Fuzzy Rules for HTML Transcoding
With the increasing availability of Web-enabled mobile devices, we are facing the problem to effectively adapt Web content for those devices. For adaptation, Web page structures r...
Robbie Schaefer, Andreas Dangberg, Wolfgang Mü...
PRL
2010
181views more  PRL 2010»
15 years 2 months ago
Gait recognition without subject cooperation
The strength of gait, compared to other biometrics, is that it does not require cooperative subjects. Previoius gait recognition approaches were evaluated using a gallery set cons...
Khalid Bashir, Tao Xiang, Shaogang Gong