Search Sciweavers | Sciweavers

187 search results - page 28 / 38

» Hedging Uncertainty: Approximation Algorithms for Stochastic...

click to vote

ICRA
2010
IEEE

163views Robotics» more ICRA 2010»

Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

14 years 10 months ago

Download robotics.ai.uiuc.edu

Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...

Salvatore Candido, James C. Davidson, Seth Hutchin...

claim paper

Read More »

click to vote

CDC
2010
IEEE

139views Control Systems» more CDC 2010»

Q-learning and enhanced policy iteration in discounted dynamic programming

14 years 6 months ago

Download web.mit.edu

We consider the classical finite-state discounted Markovian decision problem, and we introduce a new policy iteration-like algorithm for finding the optimal state costs or Q-facto...

Dimitri P. Bertsekas, Huizhen Yu

claim paper

Read More »

111

click to vote

CDC
2009
IEEE

147views Control Systems» more CDC 2009»

A simulation-based method for aggregating Markov chains

15 years 4 months ago

Download mechse.illinois.edu

— This paper addresses model reduction for a Markov chain on a large state space. A simulation-based framework is introduced to perform state aggregation of the Markov chain base...

Kun Deng, Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

103

click to vote

ATAL
2008
Springer

184views Intelligent Agents» more ATAL 2008»

Sequential decision making with untrustworthy service providers

15 years 1 months ago

Download www.aamas-conference.org

In this paper, we deal with the sequential decision making problem of agents operating in computational economies, where there is uncertainty regarding the trustworthiness of serv...

W. T. Luke Teacy, Georgios Chalkiadakis, Alex Roge...

claim paper

Read More »

109

click to vote

INFOCOM
2007
IEEE

160views Communications» more INFOCOM 2007»

Optimal Policies for Distributed Data Aggregation in Wireless Sensor Networks

15 years 6 months ago

Download www.ecse.rpi.edu

Abstract— We consider the scenario of distributed data aggregation in wireless sensor networks, where each sensor can obtain and estimate the information of the whole sensing ﬁ...

Zhenzhen Ye, Alhussein A. Abouzeid, Jing Ai

posted by yecloud

Read More »

« Prev « First page 28 / 38 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers