Search Sciweavers | Sciweavers

4544 search results - page 302 / 909

» Reinforcement Learning with Time

135

click to vote

IICS
2009
Springer

212views Internet Technology» more IICS 2009»

Bi-directional Distribution of eLearning Content for Cross-technology Learning Communities

15 years 8 months ago

Download subs.emis.de

: This article describes the use of a service-oriented architecture to bridge the gap between different eLearning types and tools. The basic concept is a bi-directional distributio...

Raphael Zender, Enrico Dressler, Ulrike Lucke, Dja...

claim paper

Read More »

134

click to vote

JKM
2006

135views more JKM 2006»

Learning from the Mars Rover Mission: scientific discovery, learning and memory

15 years 3 months ago

Download ti.arc.nasa.gov

Purpose Knowledge management for space exploration is part of a multi-generational effort. Each mission builds on knowledge from prior missions, and learning is the first step in ...

Charlotte Linde

claim paper

Read More »

130

Voted

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 6 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

139

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 5 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

141

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

15 years 5 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

« Prev « First page 302 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers