Search Sciweavers | Sciweavers

1310 search results - page 141 / 262

» Progressive Optimization in Action

143

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 9 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

116

click to vote

AI
2008
Springer

101views Artificial Intelligence» more AI 2008»

An approach to efficient planning with numerical fluents and multi-criteria plan quality

15 years 4 months ago

Download www.informatik.uni-freiburg.de

Dealing with numerical information is practically important in many real-world planning domains where the executability of an action can depend on certain numerical conditions, an...

Alfonso Gerevini, Alessandro Saetti, Ivan Serina

claim paper

Read More »

140

click to vote

ICML
2005
IEEE

135views Machine Learning» more ICML 2005»

Finite time bounds for sampling based fitted value iteration

16 years 5 months ago

Download www.machinelearning.org

In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...

Csaba Szepesvári, Rémi Munos

claim paper

Read More »

144

click to vote

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 11 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

147

click to vote

ICRA
2005
IEEE

151views Robotics» more ICRA 2005»

Multi-Step Look-Ahead Trajectory Planning in SLAM: Possibility and Necessity

15 years 10 months ago

Download services.eng.uts.edu.au

Abstract— In this paper, the possibility and necessity of multistep trajectory planning in Extended Kalman Filter (EKF) based SLAM is investigated. The objective of the trajector...

Shoudong Huang, Ngai Ming Kwok, Gamini Dissanayake...

claim paper

Read More »

« Prev « First page 141 / 262 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers