Search Sciweavers | Sciweavers

3874 search results - page 517 / 775

» Approximation Algorithms for k-hurdle Problems

163

click to vote

AGENTS
2001
Springer

201views Security Privacy» more AGENTS 2001»

Using background knowledge to speed reinforcement learning in physical agents

15 years 9 months ago

Download www.isle.org

This paper describes Icarus, an agent architecture that embeds a hierarchical reinforcement learning algorithm within a language for specifying agent behavior. An Icarus program e...

Daniel G. Shapiro, Pat Langley, Ross D. Shachter

claim paper

Read More »

136

click to vote

AAAI
2007

126views Intelligent Agents» more AAAI 2007»

Point-Based Policy Iteration

15 years 7 months ago

Download www.cs.duke.edu

We describe a point-based policy iteration (PBPI) algorithm for inﬁnite-horizon POMDPs. PBPI replaces the exact policy improvement step of Hansen’s policy iteration with point...

Shihao Ji, Ronald Parr, Hui Li, Xuejun Liao, Lawre...

claim paper

Read More »

132

click to vote

COMPGEOM
2005
ACM

123views Discrete Geometry» more COMPGEOM 2005»

1-link shortest paths in weighted regions

15 years 6 months ago

Download www.tiger-marmalade.com

We illustrate the Link Solver software for computing 1-link shortest paths in weighted regions. The Link Solver implements a prune-and-search method that can be used to approximat...

Ovidiu Daescu, James D. Palmer

claim paper

Read More »

139

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Multi-Agent Learning with Policy Prediction

15 years 6 months ago

Download www.cs.umass.edu

Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting th...

Chongjie Zhang, Victor R. Lesser

claim paper

Read More »

130

click to vote

DAGSTUHL
2007

152views Software Engineering» more DAGSTUHL 2007»

An Inner/Outer Stationary Iteration for Computing PageRank

15 years 6 months ago

Download drops.dagstuhl.de

We present a stationary iterative scheme for PageRank computation. The algorithm is based on a linear system formulation of the problem, uses inner/outer iterations, and amounts to...

Andrew P. Gray, Chen Greif, Tracy Lau

claim paper

Read More »

« Prev « First page 517 / 775 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers