Search Sciweavers | Sciweavers

14

NIPS
2008

165views Information Technology» more NIPS 2008»

13 years 5 months ago

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

10

click to vote

AUTOMATICA
2005

108views more AUTOMATICA 2005»

Robust optimal control of regular languages

13 years 4 months ago

Download wimpy1.psu.edu

This paper presents an algorithm for robust optimal control of regular languages under specified uncertainty bounds on the event cost parameters of the language measure that has b...

Constantino M. Lagoa, Jinbo Fu, Asok Ray

claim paper

Read More »

9

click to vote

CORR
2007
Springer

94views Education» more CORR 2007»

Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm

13 years 4 months ago

Download www.ieee-infocom.org

— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...

Bruce Hajek, Kevin Mitzel, Sichao Yang

claim paper

Read More »

12

click to vote

NA
2007

105views more NA 2007»

Orthogonal projection regularization operators

13 years 4 months ago

Download www.math.kent.edu

Abstract. Tikhonov regularization often is applied with a ﬁnite diﬀerence regularization operator that approximates a low-order derivative. This paper proposes the use of ortho...

Serena Morigi, Lothar Reichel, Fiorella Sgallari

claim paper

Read More »

14

click to vote

AAAI
2007

126views Intelligent Agents» more AAAI 2007»

Point-Based Policy Iteration

13 years 6 months ago

Download www.cs.duke.edu

We describe a point-based policy iteration (PBPI) algorithm for inﬁnite-horizon POMDPs. PBPI replaces the exact policy improvement step of Hansen’s policy iteration with point...

Shihao Ji, Ronald Parr, Hui Li, Xuejun Liao, Lawre...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers