Search Sciweavers | Sciweavers

948 search results - page 129 / 190

» Modelling Agents as Observable Sources

154

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

191

click to vote

AAAI
1993

189views Intelligent Agents» more AAAI 1993»

A Method for Development of Dialogue Managers for Natural Language Interfaces

15 years 5 months ago

Download www.ida.liu.se

This paper describes a method for the development of dialogue managers for natural language interfaces. A dialogue manager is presented designed on the basis of both a theoretical...

Arne Jönsson

claim paper

Read More »

143

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 5 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

181

click to vote

ICAI
2009

143views Artificial Intelligence» more ICAI 2009»

Expectancy-Based Robot Localization Through Context Evaluation

15 years 1 months ago

Download www.ai.rug.nl

Agents that operate in a real-world environment have to process an abundance of information, which may be ambiguous or noisy. We present a method inspired by cognitive research tha...

Maria E. Niessen, Gert Kootstra, Sjoerd de Jong, T...

claim paper

Read More »

142

click to vote

ATAL
2005
Springer

103views Intelligent Agents» more ATAL 2005»

Reasoning about joint beliefs for execution-time communication decisions

15 years 9 months ago

Download www.cs.huji.ac.il

Just as POMDPs have been used to reason explicitly about uncertainty in single-agent systems, there has been recent interest in using multi-agent POMDPs to coordinate teams of age...

Maayan Roth, Reid G. Simmons, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 129 / 190 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers