Search Sciweavers | Sciweavers

1262 search results - page 168 / 253

» Reinforcement Learning: An Introduction

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

13 years 10 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

14 years 9 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

click to vote

ECML
2004
Springer

154views Machine Learning» more ECML 2004»

Experiments in Value Function Approximation with Sparse Support Vector Regression

14 years 1 months ago

Download userweb.cs.utexas.edu

Abstract. We present ﬁrst experiments using Support Vector Regression as function approximator for an on-line, sarsa-like reinforcement learner. To overcome the batch nature of S...

Tobias Jung, Thomas Uthmann

claim paper

Read More »

click to vote

CSREAEEE
2008

199views Business» more CSREAEEE 2008»

Progranimate - A Web Enabled Algorithmic Problem Solving Application

13 years 9 months ago

Download www.comp.glam.ac.uk

- This paper proposes the use of an interactive web based problem solving application that utilises flowchart based programming and code generation to address the issues faced by n...

Andrew Scott, Mike Watkins, Duncan McPhee

claim paper

Read More »

click to vote

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

14 years 1 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

« Prev « First page 168 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers