Search Sciweavers | Sciweavers

24 search results - page 4 / 5

» Learning Policy Improvements with Path Integrals

108

click to vote

STOC
2006
ACM

122views Algorithms» more STOC 2006»

Fast convergence to Wardrop equilibria by adaptive sampling methods

16 years 13 days ago

Download www-i1.informatik.rwth-aachen.de

We study rerouting policies in a dynamic round-based variant of a well known game theoretic traffic model due to Wardrop. Previous analyses (mostly in the context of selfish routi...

Simon Fischer, Harald Räcke, Berthold Vö...

claim paper

Read More »

click to vote

ICML
2002
IEEE

138views Machine Learning» more ICML 2002»

Reinforcement Learning and Shaping: Encouraging Intended Behaviors

16 years 28 days ago

Download www.grappa.univ-lille3.fr

We explore dynamic shaping to integrate our prior beliefs of the final policy into a conventional reinforcement learning system. Shaping provides a positive or negative artificial...

Adam Laud, Gerald DeJong

claim paper

Read More »

click to vote

FLAIRS
2006

149views Artificial Intelligence» more FLAIRS 2006»

Simulated Visual Perception-Based Control for Autonomous Mobile Agents

15 years 1 months ago

Download www.aaai.org

Autonomous robots, such as automatic vacuum cleaners, toy robot dogs, and autonomous vehicles for the military, are rapidly becoming a part of everyday life. As a result the need ...

Daniel Flower, Burkhard Wünsche, Hans W. Gues...

claim paper

Read More »

109

click to vote

MONET
2007

132views more MONET 2007»

QUORUM - Quality of Service in Wireless Mesh Networks

14 years 11 months ago

Download www.cs.ucsb.edu

Abstract Wireless mesh networks (WMNs) can provide seamless broadband connectivity to network users with low setup and maintenance costs. To support nextgeneration applications wit...

Vinod Kone, Sudipto Das, Ben Y. Zhao, Haitao Zheng

claim paper

Read More »

112

Voted

KDD
2008
ACM

193views Data Mining» more KDD 2008»

A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances

16 years 16 days ago

Download www.isys.ucl.ac.be

This work introduces a new family of link-based dissimilarity measures between nodes of a weighted directed graph. This measure, called the randomized shortest-path (RSP) dissimil...

Luh Yen, Marco Saerens, Amin Mantrach, Masashi Shi...

claim paper

Read More »

« Prev « First page 4 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers