Search Sciweavers | Sciweavers

1512 search results - page 221 / 303

» Qualitative reinforcement learning

click to vote

ACSE
2000
ACM

271views Theoretical Computer Science» more ACSE 2000»

The information environments program - a new design based IT degree

14 years 1 months ago

Download www.itee.uq.edu.au

The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...

Michael Docherty, Peter Sutton, Margot Brereton, S...

claim paper

Read More »

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

14 years 1 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

13 years 10 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Combining model-based and instance-based learning for first order regression

14 years 10 months ago

Download www.cs.kuleuven.ac.be

T ORDER REGRESSION (EXTENDED ABSTRACT) Kurt Driessensa Saso Dzeroskib a Department of Computer Science, University of Waikato, Hamilton, New Zealand (kurtd@waikato.ac.nz) b Departm...

Kurt Driessens, Saso Dzeroski

claim paper

Read More »

click to vote

DSMML
2004
Springer

170views Machine Learning» more DSMML 2004»

Can Gaussian Process Regression Be Made Robust Against Model Mismatch?

14 years 2 months ago

Download eprints.pascal-network.org

Learning curves for Gaussian process (GP) regression can be strongly aﬀected by a mismatch between the ‘student’ model and the ‘teacher’ (true data generation process), e...

Peter Sollich

claim paper

Read More »

« Prev « First page 221 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers