Sciweavers

1310 search results - page 141 / 262
» Progressive Optimization in Action
Sort
View
IWANN
1999
Springer
15 years 9 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson
AI
2008
Springer
15 years 4 months ago
An approach to efficient planning with numerical fluents and multi-criteria plan quality
Dealing with numerical information is practically important in many real-world planning domains where the executability of an action can depend on certain numerical conditions, an...
Alfonso Gerevini, Alessandro Saetti, Ivan Serina
ICML
2005
IEEE
16 years 5 months ago
Finite time bounds for sampling based fitted value iteration
In this paper we consider sampling based fitted value iteration for discounted, large (possibly infinite) state space, finite action Markovian Decision Problems where only a gener...
Csaba Szepesvári, Rémi Munos
ICANNGA
2007
Springer
105views Algorithms» more  ICANNGA 2007»
15 years 11 months ago
Reinforcement Learning in Fine Time Discretization
Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...
Pawel Wawrzynski
ICRA
2005
IEEE
151views Robotics» more  ICRA 2005»
15 years 10 months ago
Multi-Step Look-Ahead Trajectory Planning in SLAM: Possibility and Necessity
Abstract— In this paper, the possibility and necessity of multistep trajectory planning in Extended Kalman Filter (EKF) based SLAM is investigated. The objective of the trajector...
Shoudong Huang, Ngai Ming Kwok, Gamini Dissanayake...