Sciweavers

231 search results - page 12 / 47
» Sensitivity of trust-region algorithms to their parameters
Sort
View
NECO
2010
97views more  NECO 2010»
13 years 5 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
FTDCS
1997
IEEE
13 years 11 months ago
A Scheduling Algorithm for Aperiodic Groups of Tasks in Distributed Real-Time Systems and its Holistic Analysis
This paper deals with the problem of scheduling aperiodic groups of tasks in distributed systems. It proposes two contributions, namely: i) a distributed scheduling algorithm to b...
Paolo Bizzarri, Andrea Bondavalli, Felicita Di Gia...
NAACL
2003
13 years 8 months ago
Weakly Supervised Natural Language Learning Without Redundant Views
We investigate single-view algorithms as an alternative to multi-view algorithms for weakly supervised learning for natural language processing tasks without a natural feature spl...
Vincent Ng, Claire Cardie
MICCAI
2003
Springer
14 years 8 months ago
An Artificially Evolved Vision System for Segmenting Skin Lesion Images
Abstract. We present a novel technique where a medical image segmentation system is evolved using genetic programming. The evolved system was trained on just 8 images outlined by a...
Mark E. Roberts, Ela Claridge
WADS
2007
Springer
115views Algorithms» more  WADS 2007»
14 years 1 months ago
Alpha-Beta Witness Complexes
Building on the work of Martinetz, Schulten and de Silva, Carlsson, we introduce a 2-parameter family of witness complexes and algorithms for constructing them. This family can be ...
Dominique Attali, Herbert Edelsbrunner, John Harer...