Search Sciweavers | Sciweavers

14 search results - page 2 / 3

» On the Convergence of Reduction-based and Model-based Method...

click to vote

ICML
1995
IEEE

155views Machine Learning» more ICML 1995»

Stable Function Approximation in Dynamic Programming

14 years 6 months ago

Download www.ri.cmu.edu

The success ofreinforcement learninginpractical problems depends on the ability to combine function approximation with temporal di erence methods such as value iteration. Experime...

Geoffrey J. Gordon

claim paper

Read More »

click to vote

IMAMS
2007

245views Mathematics» more IMAMS 2007»

Discrete Surface Ricci Flow: Theory and Applications

13 years 6 months ago

Download www.cs.sunysb.edu

Conformal geometry is in the core of pure mathematics. Conformal structure is more ﬂexible than Riemaniann metric but more rigid than topology. Conformal geometric methods have p...

Miao Jin, Junho Kim, Xianfeng David Gu

claim paper

Read More »

click to vote

CSFW
2010
IEEE

200views Security Privacy» more CSFW 2010»

Impossibility Results for Secret Establishment

13 years 9 months ago

Download people.inf.ethz.ch

—Security protocol design is a creative discipline where the solution space depends on the problem to be solved and the cryptographic operators available. In this paper, we exami...

Benedikt Schmidt, Patrick Schaller, David A. Basin

claim paper

Read More »

click to vote

KDD
2008
ACM

120views Data Mining» more KDD 2008»

Multi-class cost-sensitive boosting with p-norm loss functions

14 years 5 months ago

Download www.research.ibm.com

We propose a family of novel cost-sensitive boosting methods for multi-class classification by applying the theory of gradient boosting to p-norm based cost functionals. We establ...

Aurelie C. Lozano, Naoki Abe

claim paper

Read More »

click to vote

IROS
2008
IEEE

125views Robotics» more IROS 2008»

Dynamic correlation matrix based multi-Q learning for a multi-robot system

13 years 11 months ago

Download www.ece.stevens-tech.edu

—Multi-robot reinforcement learning is a very challenging area due to several issues, such as large state spaces, difficulty in reward assignment, nondeterministic action selecti...

Hongliang Guo, Yan Meng

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers