Search Sciweavers | Sciweavers

1236 search results - page 209 / 248

» Opposition-Based Reinforcement Learning

193

click to vote

ACSE
2000
ACM

271views Theoretical Computer Science» more ACSE 2000»

The information environments program - a new design based IT degree

15 years 10 months ago

Download www.itee.uq.edu.au

The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...

Michael Docherty, Peter Sutton, Margot Brereton, S...

claim paper

Read More »

163

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 10 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

180

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

156

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

16 years 7 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

158

click to vote

HICSS
2003
IEEE

116views Biometrics» more HICSS 2003»

Modeling Instrumental Conditioning - The Behavioral Regulation Approach

15 years 11 months ago

Download www.hicss.hawaii.edu

Basically, instrumental conditioning is learning through consequences: Behavior that produces positive results (high “instrumental response”) is reinforced, and that which pro...

Jose J. Gonzalez, Agata Sawicka

claim paper

Read More »

« Prev « First page 209 / 248 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers