Search Sciweavers | Sciweavers

1512 search results - page 195 / 303

» Qualitative reinforcement learning

203

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 25 days ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

157

click to vote

ICML
1998
IEEE

165views Machine Learning» more ICML 1998»

Intra-Option Learning about Temporally Abstract Actions

16 years 6 months ago

Download www.cs.ualberta.ca

tion Learning about Temporally Abstract Actions Richard S. Sutton Department of Computer Science University of Massachusetts Amherst, MA 01003-4610 rich@cs.umass.edu Doina Precup D...

Richard S. Sutton, Doina Precup, Satinder P. Singh

claim paper

Read More »

158

click to vote

ATAL
2009
Springer

155views Intelligent Agents» more ATAL 2009»

MABLE: a framework for learning from natural instruction

16 years 19 days ago

Download www.personal.utulsa.edu

The Modular Architecture for Bootstrapped Learning Experiments (MABLE) is a system that is being developed to allow humans to teach computers in the most natural manner possible: ...

Roger Mailler, Daniel Bryce, Jiaying Shen, Ciaran ...

claim paper

Read More »

163

click to vote

GECCO
2005
Springer

139views Optimization» more GECCO 2005»

Event-driven learning classifier systems for online soccer games

15 years 11 months ago

Download www.genetic-programming.org

This paper reports on the application of classifier systems to the acquisition of decision-making algorithms for agents in online soccer games. The objective of this research is t...

Yuji Sato, Ryutaro Kanno

claim paper

Read More »

137

click to vote

AAAI
2000

147views Intelligent Agents» more AAAI 2000»

ADVISOR: A Machine Learning Architecture for Intelligent Tutor Construction

15 years 7 months ago

Download www.aaai.org

We have constructed ADVISOR, a two-agent machine learning architecture for intelligent tutoring systems (ITS). The purpose of this architecture is to centralize the reasoning of a...

Joseph Beck, Beverly Park Woolf, Carole R. Beal

claim paper

Read More »

« Prev « First page 195 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers