Search Sciweavers | Sciweavers

829 search results - page 119 / 166

» A time aggregation approach to Markov decision processes

134

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Authorial Idioms for Target Distributions in TTD-MDPs

15 years 4 months ago

Download www.cc.gatech.edu

In designing Markov Decision Processes (MDP), one must deﬁne the world, its dynamics, a set of actions, and a reward function. MDPs are often applied in situations where there i...

David L. Roberts, Sooraj Bhat, Kenneth St. Clair, ...

claim paper

Read More »

112

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 3 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

119

click to vote

ENTCS
2008

110views more ENTCS 2008»

Game-Based Probabilistic Predicate Abstraction in PRISM

15 years 2 months ago

Download qav.comlab.ox.ac.uk

ion in PRISM1 Mark Kattenbelt Marta Kwiatkowska Gethin Norman David Parker Oxford University Computing Laboratory, Oxford, UK Modelling and verification of systems such as communi...

Mark Kattenbelt, Marta Z. Kwiatkowska, Gethin Norm...

claim paper

Read More »

141

click to vote

JAIR
2010

115views more JAIR 2010»

An Investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs

15 years 13 days ago

Download www.jair.org

Decentralized planning in uncertain environments is a complex task generally dealt with by using a decision-theoretic approach, mainly through the framework of Decentralized Parti...

Raghav Aras, Alain Dutech

claim paper

Read More »

134

click to vote

GLOBECOM
2010
IEEE

189views Communications» more GLOBECOM 2010»

Need-Based Communication for Smart Grid: When to Inquire Power Price?

14 years 12 months ago

Download iweb.tntech.edu

In smart grid, a home appliance can adjust its power consumption level according to the realtime power price obtained from communication channels. Most studies on smart grid do not...

Husheng Li, Robert C. Qiu

claim paper

Read More »

« Prev « First page 119 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers