Search Sciweavers | Sciweavers

32

NIPS
2007

80views Information Technology» more NIPS 2007»

14 years 17 days ago

Recently, we have introduced a novel approach to dynamic programming and reinforcement learning that is based on maintaining explicit representations of stationary distributions i...

Tao Wang, Daniel J. Lizotte, Michael H. Bowling, D...

claim paper

Read More »

36

click to vote

AGI
2008

142views Artificial Intelligence» more AGI 2008»

Transfer Learning and Intelligence: an Argument and Approach

14 years 18 days ago

Download www.cs.utexas.edu

In order to claim fully general intelligence in an autonomous agent, the ability to learn is one of the most central capabilities. Classical machine learning techniques have had ma...

Matthew E. Taylor, Gregory Kuhlmann, Peter Stone

claim paper

Read More »

38

click to vote

IJCNN
2008
IEEE

202views Neural Networks» more IJCNN 2008»

Learning to select relevant perspective in a dynamic environment

14 years 5 months ago

Download www.cs.qub.ac.uk

— When an agent observes its environment, there are two important characteristics of the perceived information. One is the relevance of information and the other is redundancy. T...

Zhihui Luo, David A. Bell, Barry McCollum, Qingxia...

claim paper

Read More »

34

click to vote

AIIDE
2008

146views Artificial Intelligence» more AIIDE 2008»

Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games

14 years 1 months ago

Download www.aaai.org

We introduce the ALeRT (Action-dependent Learning Rates with Trends) algorithm that makes two modifications to the learning rate and one change to the exploration rate of traditio...

Maria Cutumisu, Duane Szafron, Michael H. Bowling,...

claim paper

Read More »

50

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

14 years 12 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers