Search Sciweavers | Sciweavers

104

RTAS
2006
IEEE

135views Embedded Systems» more RTAS 2006»

Scalable Modeling and Performance Evaluation of Wireless Sensor Networks

15 years 5 months ago

A notable features of many proposed Wireless Sensor Networks (WSNs) deployments is their scale: hundreds to thousands of nodes linked together. In such systems, modeling the state...

YoungMin Kwon, Gul Agha

claim paper

Read More »

97

click to vote

ATVA
2010
Springer

284views Hardware» more ATVA 2010»

YAGA: Automated Analysis of Quantitative Safety Specifications in Probabilistic B

15 years 18 days ago

Download web.science.mq.edu.au

Probabilistic B (pB) [2, 8] extends classical B [7] to incorporate probabilistic updates together with the specification of quantitative safety properties. As for classical B, prob...

Ukachukwu Ndukwu, A. K. McIver

claim paper

Read More »

138

click to vote

QEST
2010
IEEE

139views Modeling and Simulation» more QEST 2010»

Reasoning about MDPs as Transformers of Probability Distributions

14 years 9 months ago

Download osl.cs.uiuc.edu

We consider Markov Decision Processes (MDPs) as transformers on probability distributions, where with respect to a scheduler that resolves nondeterminism, the MDP can be seen as ex...

Vijay Anand Korthikanti, Mahesh Viswanathan, Gul A...

claim paper

Read More »

92

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

14 years 11 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

93

click to vote

CORR
2006
Springer

113views Education» more CORR 2006»

A Unified View of TD Algorithms; Introducing Full-Gradient TD and Equi-Gradient Descent TD

14 years 11 months ago

Download hal.inria.fr

This paper addresses the issue of policy evaluation in Markov Decision Processes, using linear function approximation. It provides a unified view of algorithms such as TD(), LSTD()...

Manuel Loth, Philippe Preux

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers