Search Sciweavers | Sciweavers

120 search results - page 19 / 24

» Hierarchical Solution of Markov Decision Processes using Mac...

click to vote

IJCAI
2007

175views Artificial Intelligence» more IJCAI 2007»

An Experts Algorithm for Transfer Learning

14 years 11 months ago

Download www.ijcai.org

A long-lived agent continually faces new tasks in its environment. Such an agent may be able to use knowledge learned in solving earlier tasks to produce candidate policies for it...

Erik Talvitie, Satinder Singh

claim paper

Read More »

100

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

15 years 10 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

click to vote

AAAI
2004

167views Intelligent Agents» more AAAI 2004»

Dynamic Programming for Partially Observable Stochastic Games

14 years 11 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable M...

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber...

claim paper

Read More »

click to vote

ICRA
2010
IEEE

143views Robotics» more ICRA 2010»

Apprenticeship learning via soft local homomorphisms

14 years 8 months ago

Download damas.ift.ulaval.ca

Abstract— We consider the problem of apprenticeship learning when the expert’s demonstration covers only a small part of a large state space. Inverse Reinforcement Learning (IR...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

ICRA
2010
IEEE

97views Robotics» more ICRA 2010»

Probabilistic motion planning of balloons in strong, uncertain wind fields

14 years 8 months ago

Download web.mit.edu

—This paper introduces a new algorithm for probabilistic motion planning in arbitrary, uncertain vector ﬁelds, with emphasis on high-level planning for Montgolﬁer´e balloons...

Michael T. Wolf, Lars Blackmore, Yoshiaki Kuwata, ...

claim paper

Read More »

« Prev « First page 19 / 24 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers