Search Sciweavers | Sciweavers

25 search results - page 4 / 5

» A set-membership state estimation algorithm based on DC prog...

click to vote

ICC
2007
IEEE

185views Communications» more ICC 2007»

Maximal Lifetime Rate and Power Allocation for Sensor Networks with Data Distortion Constraints

13 years 11 months ago

Download www.prism.uvsq.fr

— We address a lifetime maximization problem for a single-hop wireless sensor network where multiple sensors encode and communicate their measurements of a Gaussian random source...

James C. F. Li, Subhrakanti Dey, Jamie S. Evans

claim paper

Read More »

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 6 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

SPIN
2004
Springer

316views Theoretical Computer Science» more SPIN 2004»

Directed Error Detection in C++ with the Assembly-Level Model Checker StEAM

13 years 10 months ago

Download spinroot.com

Most approaches for model checking software are based on ration of abstract models from source code, which may greatly reduce the search space, but may also introduce errors that a...

Peter Leven, Tilman Mehler, Stefan Edelkamp

claim paper

Read More »

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

13 years 11 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

ICML
2008
IEEE

117views Machine Learning» more ICML 2008»

Sample-based learning and search with permanent and transient memories

14 years 5 months ago

Download www.cs.ualberta.ca

We present a reinforcement learning architecture, Dyna-2, that encompasses both samplebased learning and sample-based search, and that generalises across states during both learni...

David Silver, Martin Müller 0003, Richard S. ...

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers