Search Sciweavers | Sciweavers

3837 search results - page 81 / 768

» Learning Approximate Consistencies

126

click to vote

ICML
2003
IEEE

157views Machine Learning» more ICML 2003»

Action Elimination and Stopping Conditions for Reinforcement Learning

16 years 4 months ago

Download www.hpl.hp.com

We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...

Eyal Even-Dar, Shie Mannor, Yishay Mansour

claim paper

Read More »

250

click to vote

POPL
2007
ACM

129views Programming Languages» more POPL 2007»

Program verification as probabilistic inference

16 years 4 months ago

Download research.microsoft.com

In this paper, we propose a new algorithm for proving the validity or invalidity of a pre/postcondition pair for a program. The algorithm is motivated by the success of the algori...

Sumit Gulwani, Nebojsa Jojic

claim paper

Read More »

151

click to vote

ML
2002
ACM

140views Machine Learning» more ML 2002»

A Probabilistic Framework for SVM Regression and Error Bar Estimation

15 years 3 months ago

Download eprints.ecs.soton.ac.uk

In this paper, we elaborate on the well-known relationship between Gaussian Processes (GP) and Support Vector Machines (SVM) under some convex assumptions for the loss functions. ...

Junbin Gao, Steve R. Gunn, Chris J. Harris, Martin...

claim paper

Read More »

148

click to vote

ICONIP
2007

141views Information Technology» more ICONIP 2007»

Natural Conjugate Gradient in Variational Inference

15 years 5 months ago

Download eprints.pascal-network.org

Variational methods for approximate inference in machine learning often adapt a parametric probability distribution to optimize a given objective function. This view is especially ...

Antti Honkela, Matti Tornio, Tapani Raiko, Juha Ka...

claim paper

Read More »

146

click to vote

NIPS
2008

165views Information Technology» more NIPS 2008»

Regularized Policy Iteration

15 years 5 months ago

Download webdocs.cs.ualberta.ca

In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...

Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...

claim paper

Read More »

« Prev « First page 81 / 768 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers