Search Sciweavers | Sciweavers

2073 search results - page 122 / 415

» Learning for Dynamic Subsumption

203

click to vote

CIG
2005
IEEE

162views Applied Computing» more CIG 2005»

Nannon: A Nano Backgammon for Machine Learning Research

16 years 22 days ago

Download cswww.essex.ac.uk

A newly designed game is introduced, which feels like Backgammon, but has a simplified rule set. Unlike earlier attempts at simplifying the game, Nannon maintains enough features a...

Jordan B. Pollack

claim paper

Read More »

213

click to vote

ICDM
2005
IEEE

163views Data Mining» more ICDM 2005»

Balancing Exploration and Exploitation: A New Algorithm for Active Machine Learning

16 years 22 days ago

Download cse.unl.edu

Active machine learning algorithms are used when large numbers of unlabeled examples are available and getting labels for them is costly (e.g. requiring consulting a human expert)...

Thomas Takeo Osugi, Kun Deng, Stephen D. Scott

claim paper

Read More »

190

click to vote

ICRA
2002
IEEE

133views Robotics» more ICRA 2002»

The Necessity of Average Rewards in Cooperative Multirobot Learning

16 years 1 days ago

Download www.ri.cmu.edu

Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discou...

Poj Tangamchit, John M. Dolan, Pradeep K. Khosla

claim paper

Read More »

199

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

15 years 11 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

232

click to vote

WSC
2007

166views Modeling And Simulation» more WSC 2007»

Optimizing time warp simulation with reinforcement learning techniques

15 years 9 months ago

Download www.informs-sim.org

Adaptive Time Warp protocols in the literature are usually based on a pre-deﬁned analytic model of the system, expressed as a closed form function that maps system state to cont...

Jun Wang, Carl Tropper

claim paper

Read More »

« Prev « First page 122 / 415 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers