Search Sciweavers | Sciweavers

378 search results - page 45 / 76

» Reinforcement Learning for Online Control of Evolutionary Al...

140

click to vote

GECCO
2006
Springer

192views Optimization» more GECCO 2006»

Optimising cancer chemotherapy using an estimation of distribution algorithm and genetic algorithms

15 years 6 months ago

Download www.comp.rgu.ac.uk

This paper presents a methodology for using heuristic search methods to optimise cancer chemotherapy. Specifically, two evolutionary algorithms - Population Based Incremental Lear...

Andrei Petrovski, Siddhartha Shakya, John A. W. Mc...

claim paper

Read More »

159

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 10 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

118

click to vote

ATAL
2006
Springer

119views Intelligent Agents» more ATAL 2006»

Efficient agent-based models for non-genomic evolution

15 years 6 months ago

Download web.engr.oregonstate.edu

Modeling dynamical systems composed of aggregations of primitive proteins is critical to the field of astrobiological science, which studies early evolutionary structures dealing ...

Nachi Gupta, Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

131

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 6 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

152

click to vote

CDC
2010
IEEE

123views Control Systems» more CDC 2010»

Implicit learning for explicit discount targeting in Online Social networks

14 years 10 months ago

Download www.ece.tamu.edu

Online Social networks are increasingly being seen as a means of obtaining awareness of user preferences. Such awareness could be used to target goods and services at them. We cons...

Srinivas Shakkottai, Lei Ying, Sankalp Sah

claim paper

Read More »

« Prev « First page 45 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers