Search Sciweavers | Sciweavers

88

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

14 years 11 months ago

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

75

click to vote

MOR
2006

79views more MOR 2006»

The Value of Markov Chain Games with Lack of Information on One Side

14 years 9 months ago

Download www.ceremade.dauphine.fr

We consider a two-player zero-sum game given by a Markov chain over a finite set of states K and a family of zero-sum matrix games (Gk)kK. The sequence of states follows the Marko...

Jérôme Renault

claim paper

Read More »

78

click to vote

DAS
2010
Springer

139views Document Analysis» more DAS 2010»

Information extraction by finding repeated structure

14 years 7 months ago

Download www2.parc.com

Repetition of layout structure is prevalent in document images. In document design, such repetition conveys the underlying logical and functional structure of the data. For exampl...

Evgeniy Bart, Prateek Sarkar

claim paper

Read More »

89

click to vote

AAAI
2010

236views Intelligent Agents» more AAAI 2010»

Efficient Belief Propagation for Utility Maximization and Repeated Inference

14 years 11 months ago

Download www.cs.washington.edu

Many problems require repeated inference on probabilistic graphical models, with different values for evidence variables or other changes. Examples of such problems include utilit...

Aniruddh Nath, Pedro Domingos

claim paper

Read More »

80

click to vote

IJON
2002

79views more IJON 2002»

Capacity of perirhinal cortex network for recognising frequently repeating stimuli

14 years 9 months ago

Download www.cs.bris.ac.uk

Much evidence indicates that discrimination of the familiarity of visual stimuli is dependent on the perirhinal cortex of the temporal lobe. A stimulus can become familiar to anim...

Rafal Bogacz, Malcolm W. Brown

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers