Sciweavers

231 search results - page 12 / 47
» Sensitivity of trust-region algorithms to their parameters
Sort
View
NECO
2010
97views more  NECO 2010»
14 years 10 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
FTDCS
1997
IEEE
15 years 3 months ago
A Scheduling Algorithm for Aperiodic Groups of Tasks in Distributed Real-Time Systems and its Holistic Analysis
This paper deals with the problem of scheduling aperiodic groups of tasks in distributed systems. It proposes two contributions, namely: i) a distributed scheduling algorithm to b...
Paolo Bizzarri, Andrea Bondavalli, Felicita Di Gia...
NAACL
2003
15 years 1 months ago
Weakly Supervised Natural Language Learning Without Redundant Views
We investigate single-view algorithms as an alternative to multi-view algorithms for weakly supervised learning for natural language processing tasks without a natural feature spl...
Vincent Ng, Claire Cardie
90
Voted
MICCAI
2003
Springer
16 years 18 days ago
An Artificially Evolved Vision System for Segmenting Skin Lesion Images
Abstract. We present a novel technique where a medical image segmentation system is evolved using genetic programming. The evolved system was trained on just 8 images outlined by a...
Mark E. Roberts, Ela Claridge
WADS
2007
Springer
115views Algorithms» more  WADS 2007»
15 years 5 months ago
Alpha-Beta Witness Complexes
Building on the work of Martinetz, Schulten and de Silva, Carlsson, we introduce a 2-parameter family of witness complexes and algorithms for constructing them. This family can be ...
Dominique Attali, Herbert Edelsbrunner, John Harer...