Search Sciweavers | Sciweavers

89

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

14 years 11 months ago

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

96

click to vote

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

14 years 11 months ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

61

click to vote

GRAPHICSINTERFACE
2000

105views Computer Graphics» more GRAPHICSINTERFACE 2000»

Are We All In the Same "Bloat"?

14 years 11 months ago

Download www.cs.ubc.ca

"Bloat", a term that has existed in the technical community for many years, has recently received attention in the popular press. The term has a negative connotation imp...

Joanna McGrenere, Gale Moore

claim paper

Read More »

88

click to vote

IPCO
1998

99views Optimization» more IPCO 1998»

Non-approximability Results for Scheduling Problems with Minsum Criteria

14 years 11 months ago

Download www.win.tue.nl

We provide several non-approximability results for deterministic scheduling problems whose objective is to minimize the total job completion time. Unless P = NP, none of the probl...

Han Hoogeveen, Petra Schuurman, Gerhard J. Woeging...

claim paper

Read More »

76

click to vote

IJCAI
1989

122views Artificial Intelligence» more IJCAI 1989»

Constrained Heuristic Search

14 years 11 months ago

Download agi-conf.org

Cognitive architectures aspire for generality both in terms of problem solving and learning across a range of problems, yet to date few examples of domain independent learning has...

Mark S. Fox, Norman M. Sadeh, Can A. Baykan

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers