Search Sciweavers | Sciweavers

682 search results - page 95 / 137

» One-Counter Markov Decision Processes

Voted

EDM
2010

165views Data Mining» more EDM 2010»

Using a Bayesian Knowledge Base for Hint Selection on Domain Specific Problems

15 years 2 months ago

Download educationaldatamining.org

A Bayesian Knowledge Base is a generalization of traditional Bayesian Networks where nodes or groups of nodes have independence. In this paper we describe a method of generating a ...

John C. Stamper, Tiffany Barnes, Marvin J. Croy

claim paper

Read More »

Voted

IJCAI
2007

147views Artificial Intelligence» more IJCAI 2007»

The Value of Observation for Monitoring Dynamic Systems

15 years 2 months ago

Download ijcai.org

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

119

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

15 years 2 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

click to vote

IJCAI
2007

175views Artificial Intelligence» more IJCAI 2007»

An Experts Algorithm for Transfer Learning

15 years 2 months ago

Download www.ijcai.org

A long-lived agent continually faces new tasks in its environment. Such an agent may be able to use knowledge learned in solving earlier tasks to produce candidate policies for it...

Erik Talvitie, Satinder Singh

claim paper

Read More »

113

Voted

AAAI
2004

167views Intelligent Agents» more AAAI 2004»

Dynamic Programming for Partially Observable Stochastic Games

15 years 2 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable M...

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber...

claim paper

Read More »

« Prev « First page 95 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers