Sciweavers

1235 search results - page 163 / 247
» Reinforcement learning in a nutshell
Sort
View
IJCNN
2008
IEEE
14 years 4 months ago
Learning to select relevant perspective in a dynamic environment
— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...
Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...
AIIDE
2008
14 years 14 days ago
Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games
We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...
Maria Cutumisu, Duane Szafron, Michael H. Bowling,...
ICML
2000
IEEE
14 years 11 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
ECML
2004
Springer
14 years 3 months ago
Experiments in Value Function Approximation with Sparse Support Vector Regression
Abstract. We present first experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...
Tobias Jung, Thomas Uthmann
CSREAEEE
2008
199views Business» more  CSREAEEE 2008»
13 years 11 months ago
Progranimate - A Web Enabled Algorithmic Problem Solving Application
- This paper proposes the use of an interactive web based problem solving application that utilises flowchart based programming and code generation to address the issues faced by n...
Andrew Scott, Mike Watkins, Duncan McPhee