Search Sciweavers | Sciweavers

383 search results - page 45 / 77

» Learning Model Complexity in an Online Environment

170

click to vote

ACMICEC
2006
ACM

110views ECommerce» more ACMICEC 2006»

Learning inventory management strategies for commodity supply chains with customer satisfaction

15 years 9 months ago

Download www.site.uottawa.ca

In this paper, we look at a supply chain of commodity goods where customer demand is uncertain and partly based on reputation, and where raw material replenishment is uncertain in...

Jeroen van Luin, Han La Poutré, J. Will M. ...

claim paper

Read More »

183

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 7 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

148

click to vote

GECCO
2009
Springer

135views Optimization» more GECCO 2009»

Neuroevolutionary reinforcement learning for generalized helicopter control

16 years 2 days ago

Download www.science.uva.nl

Helicopter hovering is an important challenge problem in the ﬁeld of reinforcement learning. This paper considers several neuroevolutionary approaches to discovering robust cont...

Rogier Koppejan, Shimon Whiteson

claim paper

Read More »

174

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 7 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

158

click to vote

HRI
2006
ACM

135views Human Computer Interaction» more HRI 2006»

Teaching robots by moulding behavior and scaffolding the environment

15 years 11 months ago

Download homepages.feis.herts.ac.uk

Programming robots to carry out useful tasks is both a complex and non-trivial exercise. A simple and intuitive method to allow humans to train and shape robot behaviour is clearl...

Joe Saunders, Chrystopher L. Nehaniv, Kerstin Daut...

claim paper

Read More »

« Prev « First page 45 / 77 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers