Search Sciweavers | Sciweavers

463 search results - page 27 / 93

» Localizing Search in Reinforcement Learning

138

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 10 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

120

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 4 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

114

click to vote

ICML
2003
IEEE

120views Machine Learning» more ICML 2003»

Stochastic Local Search in k-Term DNF Learning

16 years 4 months ago

Download www.hpl.hp.com

A novel native stochastic local search algorithm for solving k-term DNF problems is presented. It is evaluated on hard k-term DNF problems that lie on the phase transition and com...

Stefan Kramer, Ulrich Rückert

claim paper

Read More »

147

Voted

ECAI
2008
Springer

160views Artificial Intelligence» more ECAI 2008»

Structure Learning of Markov Logic Networks through Iterated Local Search

15 years 5 months ago

Download www.di.uniba.it

Many real-world applications of AI require both probability and first-order logic to deal with uncertainty and structural complexity. Logical AI has focused mainly on handling com...

Marenglen Biba, Stefano Ferilli, Floriana Esposito

claim paper

Read More »

127

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

15 years 1 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

« Prev « First page 27 / 93 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers