Search Sciweavers | Sciweavers

2464 search results - page 29 / 493

» Efficient learning equilibrium

189

click to vote

ESANN
2000

95views Neural Networks» more ESANN 2000»

Local input-output stability of recurrent networks with time-varying weights

15 years 8 months ago

Download www.dice.ucl.ac.be

Abstract. We present local conditions for input-output stability of recurrent neural networks with time-varying parameters introduced for instance by noise or on-line adaptation. T...

Jochen J. Steil

claim paper

Read More »

208

click to vote

JMLR
2011

188views more JMLR 2011»

Linking Granger Causality and the Pearl Causal Model with Settable Systems

15 years 2 months ago

Download fmwww.bc.edu

The causal notions embodied in the concept of Granger causality have been argued to belong to a diﬀerent category than those of Judea Pearl’s Causal Model, and so far their re...

Halbert White, Karim Chalak, Xun Lu

claim paper

Read More »

165

click to vote

ICMLA
2004

83views Machine Learning» more ICMLA 2004»

Satisficing Q-learning: efficient learning in problems with dichotomous attributes

15 years 8 months ago

Download faculty.cs.byu.edu

In some environments, a learning agent must learn to balance competing objectives. For example, a Q-learner agent may need to learn which choices expose the agent to risk and whic...

Michael A. Goodrich, Morgan Quigley

claim paper

Read More »

168

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

15 years 8 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

176

click to vote

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 7 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 29 / 493 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers