Search Sciweavers | Sciweavers

1138 search results - page 127 / 228

» Feature Markov Decision Processes

105

click to vote

AIPS
2008

151views Artificial Intelligence» more AIPS 2008»

Criticality Metrics for Distributed Plan and Schedule Management

15 years 3 months ago

Download www.aaai.org

We address the problem of coordinating the plans and schedules for a team of agents in an uncertain and dynamic environment. Bounded rationality, bounded communication, subjectivi...

Rajiv T. Maheswaran, Pedro A. Szekely

claim paper

Read More »

100

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

15 years 3 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

130

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

15 years 2 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

EDM
2010

165views Data Mining» more EDM 2010»

Using a Bayesian Knowledge Base for Hint Selection on Domain Specific Problems

15 years 2 months ago

Download educationaldatamining.org

A Bayesian Knowledge Base is a generalization of traditional Bayesian Networks where nodes or groups of nodes have independence. In this paper we describe a method of generating a ...

John C. Stamper, Tiffany Barnes, Marvin J. Croy

claim paper

Read More »

click to vote

IJCAI
2007

147views Artificial Intelligence» more IJCAI 2007»

The Value of Observation for Monitoring Dynamic Systems

15 years 2 months ago

Download ijcai.org

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

« Prev « First page 127 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers