Search Sciweavers | Sciweavers

4345 search results - page 152 / 869

» Relational Reinforcement Learning

141

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 10 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

122

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 5 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

113

Voted

ICML
2003
IEEE

108views Machine Learning» more ICML 2003»

Avoiding Bias when Aggregating Relational Data with Degree Disparity

16 years 4 months ago

Download kdl.cs.umass.edu

David Jensen, Jennifer Neville, Michael Hay

claim paper

Read More »

148

Voted

UAI
2008

236views Artificial Intelligence» more UAI 2008»

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

15 years 5 months ago

Download uai2008.cs.helsinki.fi

Continuous state spaces and stochastic, switching dynamics characterize a number of rich, realworld domains, such as robot navigation across varying terrain. We describe a reinfor...

Emma Brunskill, Bethany R. Leffler, Lihong Li, Mic...

claim paper

Read More »

128

click to vote

ATAL
2009
Springer

172views Intelligent Agents» more ATAL 2009»

Decentralized Learning in Wireless Sensor Networks

15 years 1 months ago

Download teamcore.usc.edu

In this paper we use a reinforcement learning algorithm with the aim to increase the autonomous lifetime of a Wireless Sensor Network (WSN) and decrease latency in a decentralized...

Mihail Mihaylov, Karl Tuyls, Ann Nowé

claim paper

Read More »

« Prev « First page 152 / 869 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers