Search Sciweavers | Sciweavers

78

AAAI
2006

105views Intelligent Agents» more AAAI 2006»

An Asymptotically Optimal Algorithm for the Max k-Armed Bandit Problem

15 years 1 months ago

We present an asymptotically optimal algorithm for the max variant of the k-armed bandit problem. Given a set of k slot machines, each yielding payoff from a fixed (but unknown) d...

Matthew J. Streeter, Stephen F. Smith

claim paper

Read More »

116

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

109

click to vote

AAAI
2000

135views Intelligent Agents» more AAAI 2000»

A Consistency-Based Model for Belief Change: Preliminary Report

15 years 1 months ago

Download www.aaai.org

We present a general, consistency-based framework for belief change. Informally, in revising K by , we begin with and incorporate as much of K as consistently possible. Formally, ...

James P. Delgrande, Torsten Schaub

claim paper

Read More »

84

click to vote

AAAI
1998

150views Intelligent Agents» more AAAI 1998»

Bayesian Network Models for Generation of Crisis Management Training Scenarios

15 years 1 months ago

Download www.aaai.org

We present a noisy-OR Bayesian network model for simulation-based training, and an efficient search-based algorithm for automatic synthesis of plausible training scenarios from co...

Eugene Grois, William H. Hsu, Mikhail Voloshin, Da...

claim paper

Read More »

107

click to vote

AAAI
1998

132views Intelligent Agents» more AAAI 1998»

Learning to Classify Text from Labeled and Unlabeled Documents

15 years 1 months ago

Download www.kamalnigam.com

In many important text classification problems, acquiring class labels for training documents is costly, while gathering large quantities of unlabeled data is cheap. This paper sh...

Kamal Nigam, Andrew McCallum, Sebastian Thrun, Tom...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers