Search Sciweavers | Sciweavers

35 search results - page 7 / 7

» Balancing Multiple Sources of Reward in Reinforcement Learni...

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 6 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

KDD
2009
ACM

227views Data Mining» more KDD 2009»

Efficiently learning the accuracy of labeling sources for selective sampling

14 years 5 months ago

Download www.cs.cmu.edu

Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...

Pinar Donmez, Jaime G. Carbonell, Jeff Schneider

claim paper

Read More »

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

13 years 3 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

click to vote

GECCO
2005
Springer

153views Optimization» more GECCO 2005»

Evolving neural network ensembles for control problems

13 years 10 months ago

Download userweb.cs.utexas.edu

In neuroevolution, a genetic algorithm is used to evolve a neural network to perform a particular task. The standard approach is to evolve a population over a number of generation...

David Pardoe, Michael S. Ryoo, Risto Miikkulainen

claim paper

Read More »

click to vote

TIFS
2008

154views more TIFS 2008»

Data Fusion and Cost Minimization for Intrusion Detection

13 years 5 months ago

Download ttic.uchicago.edu

Abstract--Statistical pattern recognition techniques have recently been shown to provide a finer balance between misdetections and false alarms than the more conventional intrusion...

Devi Parikh, Tsuhan Chen

claim paper

Read More »

« Prev « First page 7 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers