Sciweavers

2071 search results - page 146 / 415
» An Empirical Evaluation of LFG-DOP
Sort
View
ICML
2008
IEEE
16 years 2 months ago
Exploration scavenging
We examine the problem of evaluating a policy in the contextual bandit setting using only observations collected during the execution of another policy. We show that policy evalua...
John Langford, Alexander L. Strehl, Jennifer Wortm...
ICCAD
2007
IEEE
103views Hardware» more  ICCAD 2007»
15 years 10 months ago
Enhancing design robustness with reliability-aware resynthesis and logic simulation
While circuit density and power efficiency increase with each major advance in IC technology, reliability with respect to soft errors tends to decrease. Current solutions to this...
Smita Krishnaswamy, Stephen Plaza, Igor L. Markov,...
GFKL
2007
Springer
196views Data Mining» more  GFKL 2007»
15 years 7 months ago
Comparison of Recommender System Algorithms Focusing on the New-item and User-bias Problem
Recommender systems are used by an increasing number of e-commerce websites to help the customers to find suitable products from a large database. One of the most popular techniqu...
Stefan Hauger, Karen H. L. Tso, Lars Schmidt-Thiem...
ATAL
2008
Springer
15 years 3 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...
ACL
2010
14 years 11 months ago
Coreference Resolution with Reconcile
Despite the existence of several noun phrase coreference resolution data sets as well as several formal evaluations on the task, it remains frustratingly difficult to compare resu...
Veselin Stoyanov, Claire Cardie, Nathan Gilbert, E...