Search Sciweavers | Sciweavers

1305 search results - page 97 / 261

» An Optimization Algorithm Based on Active and Instance-Based...

207

Voted

PPSN
2004
Springer

153views Distributed And Parallel Com...» more PPSN 2004»

The Application of Bayesian Optimization and Classifier Systems in Nurse Scheduling

16 years 14 days ago

Download ima.ac.uk

Two ideas taken from Bayesian optimization and classifier systems are presented for personnel scheduling based on choosing a suitable scheduling rule from a set for each person’s...

Jingpeng Li, Uwe Aickelin

claim paper

Read More »

182

click to vote

ICML
1998
IEEE

268views Machine Learning» more ICML 1998»

The MAXQ Method for Hierarchical Reinforcement Learning

16 years 8 months ago

Download www.cs.ualberta.ca

This paper presents a new approach to hierarchical reinforcement learning based on the MAXQ decomposition of the value function. The MAXQ decomposition has both a procedural seman...

Thomas G. Dietterich

claim paper

Read More »

263

click to vote

CORR
2012
Springer

216views Education» more CORR 2012»

Fractional Moments on Bandit Problems

14 years 2 months ago

Download www.cse.iitm.ac.in

Reinforcement learning addresses the dilemma between exploration to ﬁnd profitable actions and exploitation to act according to the best observations already made. Bandit proble...

Ananda Narayanan B., Balaraman Ravindran

claim paper

Read More »

190

click to vote

FMSD
2008

110views more FMSD 2008»

Automatic symbolic compositional verification by learning assumptions

15 years 7 months ago

Download www.personal.psu.edu

Abstract Compositional reasoning aims to improve scalability of verification tools by reducing the original verification task into subproblems. The simplification is typically base...

Wonhong Nam, P. Madhusudan, Rajeev Alur

claim paper

Read More »

137

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

16 years 1 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

« Prev « First page 97 / 261 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers