Sciweavers

121 search results - page 15 / 25
» Toward Off-Policy Learning Control with Function Approximati...
Sort
View
NIPS
1996
13 years 10 months ago
Multidimensional Triangulation and Interpolation for Reinforcement Learning
Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...
Scott Davies
ACMSE
2007
ACM
14 years 1 months ago
BehaviorSim: towards an educational tool for behavior-based agent
A major paradigm of modeling the decision making of autonomous agents is through behavior-based network models. The network consists of distributed behaviors that compete (or coop...
Pavel Lakhtanau, Xiaolin Hu, Fasheng Qiu
COLT
2003
Springer
14 years 2 months ago
Learning with Rigorous Support Vector Machines
We examine the so-called rigorous support vector machine (RSVM) approach proposed by Vapnik (1998). The formulation of RSVM is derived by explicitly implementing the structural ris...
Jinbo Bi, Vladimir Vapnik
AAAI
2011
12 years 9 months ago
Combining Learned Discrete and Continuous Action Models
Action modeling is an important skill for agents that must perform tasks in novel domains. Previous work on action modeling has focused on learning STRIPS operators in discrete, r...
Joseph Z. Xu, John E. Laird
PKDD
2010
Springer
179views Data Mining» more  PKDD 2010»
13 years 7 months ago
Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration
Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...
Tobias Jung, Peter Stone