Sciweavers

966 search results - page 80 / 194
» A Two-Level Learning Method for Generalized Multi-instance P...
Sort
View
COLT
2010
Springer
14 years 12 months ago
Nonparametric Bandits with Covariates
We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...
Philippe Rigollet, Assaf Zeevi
JCP
2007
143views more  JCP 2007»
15 years 2 months ago
Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization
Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...
Nicolas Chapados, Yoshua Bengio
143
Voted
GECCO
2008
Springer
182views Optimization» more  GECCO 2008»
15 years 3 months ago
Scaling ant colony optimization with hierarchical reinforcement learning partitioning
This paper merges hierarchical reinforcement learning (HRL) with ant colony optimization (ACO) to produce a HRL ACO algorithm capable of generating solutions for large domains. Th...
Erik J. Dries, Gilbert L. Peterson
ECAI
1994
Springer
15 years 6 months ago
Reusing Proofs
1 We develop a learning component for a theorem prover designed for verifying statements by mathematical induction. If the prover has found a proof, it is analyzed yielding a so-ca...
Thomas Kolbe, Christoph Walther
HAIS
2008
Springer
15 years 3 months ago
An Evolutionary Approach for Tuning Artificial Neural Network Parameters
The widespread use of artificial neural networks and the difficult work regarding the correct specification (tuning) of parameters for a given problem are the main aspects that mot...
Leandro M. Almeida, Teresa Bernarda Ludermir