Sciweavers

908 search results - page 148 / 182
» Stochastic Finite Learning
Sort
View
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
13 years 4 months ago
Adaptive bases for Q-learning
Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a functio...
Dotan Di Castro, Shie Mannor
JMLR
2010
111views more  JMLR 2010»
13 years 3 months ago
An EM Algorithm on BDDs with Order Encoding for Logic-based Probabilistic Models
Logic-based probabilistic models (LBPMs) enable us to handle problems with uncertainty succinctly thanks to the expressive power of logic. However, most of LBPMs have restrictions...
Masakazu Ishihata, Yoshitaka Kameya, Taisuke Sato,...
TSP
2010
13 years 3 months ago
Testing stationarity with surrogates: a time-frequency approach
An operational framework is developed for testing stationarity relatively to an observation scale, in both stochastic and deterministic contexts. The proposed method is based on a ...
Pierre Borgnat, Patrick Flandrin, Paul Honeine, C&...
ICASSP
2011
IEEE
13 years 24 days ago
Factor graph-based structural equilibria in dynamical games
Correlated equilibria are a generalization of Nash equilibria that permit agents to act in a correlated manner and can therefore, model learning in games. In this paper we define...
Liming Wang, Vikram Krishnamurthy, Dan Schonfeld
ATAL
2005
Springer
14 years 2 months ago
Rapid on-line temporal sequence prediction by an adaptive agent
Robust sequence prediction is an essential component of an intelligent agent acting in a dynamic world. We consider the case of near-future event prediction by an online learning ...
Steven Jensen, Daniel Boley, Maria L. Gini, Paul R...