Search Sciweavers | Sciweavers

326 search results - page 61 / 66

» Reinforcement Learning Based on On-Line EM Algorithm

254

click to vote

WOWMOM
2005
ACM

240views Multimedia» more WOWMOM 2005»

An Adaptive Routing Protocol for Ad Hoc Peer-to-Peer Networks

16 years 6 days ago

Download sixearch.org

Ad hoc networks represent a key factor in the evolution of wireless communications. These networks typically consist of equal nodes that communicate without central control, inter...

Luca Gatani, Giuseppe Lo Re, Salvatore Gaglio

claim paper

Read More »

186

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 5 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

175

click to vote

SASO
2008
IEEE

159views Control Systems» more SASO 2008»

Bottom-Up Self-Organization of Unpredictable Demand and Supply under Decentralized Power Management

16 years 1 months ago

Download ls3-www.cs.uni-dortmund.de

In the DEZENT1 project we had established a distributed base model for negotiating electric power from widely distributed (renewable) power sources on multiple levels in successio...

Horst F. Wedde, Sebastian Lehnhoff, Christian Reht...

claim paper

Read More »

185

Voted

SDM
2008
SIAM

177views Data Mining» more SDM 2008»

Practical Private Computation and Zero-Knowledge Tools for Privacy-Preserving Distributed Data Mining

15 years 8 months ago

Download www.cs.berkeley.edu

In this paper we explore private computation built on vector addition and its applications in privacypreserving data mining. Vector addition is a surprisingly general tool for imp...

Yitao Duan, John F. Canny

claim paper

Read More »

164

click to vote

SDM
2004
SIAM

212views Data Mining» more SDM 2004»

Clustering with Bregman Divergences

15 years 8 months ago

Download jmlr.csail.mit.edu

A wide variety of distortion functions, such as squared Euclidean distance, Mahalanobis distance, Itakura-Saito distance and relative entropy, have been used for clustering. In th...

Arindam Banerjee, Srujana Merugu, Inderjit S. Dhil...

claim paper

Read More »

« Prev « First page 61 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers