Search Sciweavers | Sciweavers

212 search results - page 15 / 43

» Relational Instance Based Regression for Relational Reinforc...

click to vote

GECCO
2006
Springer

208views Optimization» more GECCO 2006»

Comparing evolutionary and temporal difference methods in a reinforcement learning domain

13 years 11 months ago

Download www.cs.bham.ac.uk

Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

HICSS
2003
IEEE

116views Biometrics» more HICSS 2003»

Modeling Instrumental Conditioning - The Behavioral Regulation Approach

14 years 20 days ago

Download www.hicss.hawaii.edu

Basically, instrumental conditioning is learning through consequences: Behavior that produces positive results (high “instrumental response”) is reinforced, and that which pro...

Jose J. Gonzalez, Agata Sawicka

claim paper

Read More »

click to vote

ICML
2010
IEEE

189views Machine Learning» more ICML 2010»

Nonparametric Return Distribution Approximation for Reinforcement Learning

13 years 8 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such ...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim...

claim paper

Read More »

click to vote

DSMML
2004
Springer

190views Machine Learning» more DSMML 2004»

Understanding Gaussian Process Regression Using the Equivalent Kernel

14 years 23 days ago

Download www.mth.kcl.ac.uk

The equivalent kernel [1] is a way of understanding how Gaussian process regression works for large sample sizes based on a continuum limit. In this paper we show how to approximat...

Peter Sollich, Christopher K. I. Williams

claim paper

Read More »

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

14 years 1 months ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

« Prev « First page 15 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers