Search Sciweavers | Sciweavers

166 search results - page 14 / 34

» Online model learning in adversarial Markov decision process...

Voted

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 4 days ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

108

click to vote

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 4 days ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

click to vote

CONNECTION
2008

178views more CONNECTION 2008»

Spoken language interaction with model uncertainty: an adaptive human-robot interaction system

14 years 11 months ago

Download people.csail.mit.edu

Spoken language is one of the most intuitive forms of interaction between humans and agents. Unfortunately, agents that interact with people using natural language often experienc...

Finale Doshi, Nicholas Roy

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 4 days ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

CORR
2010
Springer

106views Education» more CORR 2010»

MDPs with Unawareness

14 years 11 months ago

Download www.cs.cornell.edu

Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...

Joseph Y. Halpern, Nan Rong, Ashutosh Saxena

claim paper

Read More »

« Prev « First page 14 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers