Search Sciweavers | Sciweavers

129 search results - page 20 / 26

» Automatic Recovery Using Bounded Partially Observable Markov...

243

click to vote

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

14 years 7 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

212

click to vote

CONNECTION
2008

178views more CONNECTION 2008»

Spoken language interaction with model uncertainty: an adaptive human-robot interaction system

15 years 7 months ago

Download people.csail.mit.edu

Spoken language is one of the most intuitive forms of interaction between humans and agents. Unfortunately, agents that interact with people using natural language often experienc...

Finale Doshi, Nicholas Roy

claim paper

Read More »

288

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 3 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

223

click to vote

CORR
2012
Springer

229views Education» more CORR 2012»

Cops and Invisible Robbers: the Cost of Drunkenness

14 years 3 months ago

Download www.math.ryerson.ca

We examine a version of the Cops and Robber (CR) game in which the robber is invisible, i.e., the cops do not know his location until they capture him. Apparently this game (CiR) h...

Athanasios Kehagias, Dieter Mitsche, Pawel Pralat

claim paper

Read More »

173

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

16 years 2 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

« Prev « First page 20 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers