Search Sciweavers | Sciweavers

334 search results - page 55 / 67

» How to Dynamically Merge Markov Decision Processes

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 1 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

Voted

EOR
2006

66views more EOR 2006»

Performance prediction of an unmanned airborne vehicle multi-agent system

14 years 9 months ago

Download www.damas.ift.ulaval.ca

Consider unmanned airborne vehicle (UAV) control agents in a dynamic multi-agent system. The agents must have a set of goals such as destination airport and intermediate positions...

Zhaotong Lian, Abhijit Deshmukh

claim paper

Read More »

click to vote

ISCA
2009
IEEE

318views Hardware» more ISCA 2009»

Thread criticality predictors for dynamic performance, power, and resource management in chip multiprocessors

15 years 4 months ago

Download www.princeton.edu

With the shift towards chip multiprocessors (CMPs), exploiting and managing parallelism has become a central problem in computer systems. Many issues of parallelism management boi...

Abhishek Bhattacharjee, Margaret Martonosi

claim paper

Read More »

click to vote

SAC
2010
ACM

199views Applied Computing» more SAC 2010»

MetaSelf: an architecture and a development method for dependable self-* systems

15 years 4 months ago

Download www.dcs.bbk.ac.uk

This paper proposes a software architecture and a development process for engineering dependable and controllable self-organising (SO) systems. Our approach addresses dependabilit...

Giovanna Di Marzo Serugendo, John S. Fitzgerald, A...

claim paper

Read More »

click to vote

ICML
2007
IEEE

172views Machine Learning» more ICML 2007»

Conditional random fields for multi-agent reinforcement learning

15 years 10 months ago

Download www.machinelearning.org

Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...

Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...

claim paper

Read More »

« Prev « First page 55 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers