Search Sciweavers | Sciweavers

17 search results - page 3 / 4

» APRICODD: Approximate Policy Construction Using Decision Dia...

106

Voted

IJCAI
2007

182views Artificial Intelligence» more IJCAI 2007»

A Fast Analytical Algorithm for Solving Markov Decision Processes with Real-Valued Resources

15 years 2 months ago

Download teamcore.usc.edu

Agents often have to construct plans that obey deadlines or, more generally, resource limits for real-valued resources whose consumption can only be characterized by probability d...

Janusz Marecki, Sven Koenig, Milind Tambe

claim paper

Read More »

click to vote

CDC
2008
IEEE

120views Control Systems» more CDC 2008»

Approximate abstractions of discrete-time controlled stochastic hybrid systems

15 years 7 months ago

Download hybrid.stanford.edu

ate Abstractions of Discrete-Time Controlled Stochastic Hybrid Systems Alessandro D’Innocenzo, Alessandro Abate, and Maria D. Di Benedetto — This work proposes a procedure to c...

Alessandro D'Innocenzo, Alessandro Abate, Maria Do...

claim paper

Read More »

Voted

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

15 years 7 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

135

Voted

WIOPT
2011
IEEE

253views Computer Networks» more WIOPT 2011»

Network utility maximization over partially observable Markovian channels

14 years 4 months ago

Download www-scf.usc.edu

Abstract—This paper considers maximizing throughput utility in a multi-user network with partially observable Markov ON/OFF channels. Instantaneous channel states are never known...

Chih-Ping Li, Michael J. Neely

claim paper

Read More »

129

click to vote

RSS
2007

176views Robotics» more RSS 2007»

Active Policy Learning for Robot Planning and Exploration under Uncertainty

15 years 2 months ago

Download www.roboticsproceedings.org

Abstract— This paper proposes a simulation-based active policy learning algorithm for ﬁnite-horizon, partially-observed sequential decision processes. The algorithm is tested i...

Ruben Martinez-Cantin, Nando de Freitas, Arnaud Do...

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers