Search Sciweavers | Sciweavers

908 search results - page 148 / 182

» Stochastic Finite Learning

213

click to vote

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

15 years 2 months ago

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...

Dotan Di Castro, Shie Mannor

claim paper

Read More »

170

Voted

JMLR
2010

111views more JMLR 2010»

An EM Algorithm on BDDs with Order Encoding for Logic-based Probabilistic Models

15 years 2 months ago

Download jmlr.csail.mit.edu

Logic-based probabilistic models (LBPMs) enable us to handle problems with uncertainty succinctly thanks to the expressive power of logic. However, most of LBPMs have restrictions...

Masakazu Ishihata, Yoshitaka Kameya, Taisuke Sato,...

claim paper

Read More »

164

click to vote

TSP
2010

110views Artificial Intelligence» more TSP 2010»

Testing stationarity with surrogates: a time-frequency approach

15 years 2 months ago

Download hal-ens-lyon.archives-ouvertes.fr

An operational framework is developed for testing stationarity relatively to an observation scale, in both stochastic and deterministic contexts. The proposed method is based on a ...

Pierre Borgnat, Patrick Flandrin, Paul Honeine, C&...

claim paper

Read More »

235

click to vote

ICASSP
2011
IEEE

177views Signal Processing» more ICASSP 2011»

Factor graph-based structural equilibria in dynamical games

14 years 11 months ago

Download mirlab.org

Correlated equilibria are a generalization of Nash equilibria that permit agents to act in a correlated manner and can therefore, model learning in games. In this paper we deﬁne...

Liming Wang, Vikram Krishnamurthy, Dan Schonfeld

claim paper

Read More »

201

click to vote

ATAL
2005
Springer

124views Intelligent Agents» more ATAL 2005»

Rapid on-line temporal sequence prediction by an adaptive agent

16 years 25 days ago

Download gandalf.psych.umn.edu

Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning ...

Steven Jensen, Daniel Boley, Maria L. Gini, Paul R...

claim paper

Read More »

« Prev « First page 148 / 182 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers