Search Sciweavers | Sciweavers

179 search results - page 14 / 36

» Phase Transitions in Relational Learning

182

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 7 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

149

click to vote

LPNMR
2007
Springer

112views Automated Reasoning» more LPNMR 2007»

On the Effectiveness of Looking Ahead in Search for Answer Sets

16 years 25 days ago

Download webdocs.cs.ualberta.ca

Abstract. Most complete answer set solvers are based on DPLL. One of the constraint propagation methods is the so-called lookahead, which has been somewhat controversial, due to it...

Guohua Liu, Jia-Huai You

claim paper

Read More »

167

click to vote

CORR
2010
Springer

114views Education» more CORR 2010»

On the Stability of Empirical Risk Minimization in the Presence of Multiple Risk Minimizers

15 years 6 months ago

Download www.cs.berkeley.edu

Abstract--Recently Kutin and Niyogi investigated several notions of algorithmic stability--a property of a learning map conceptually similar to continuity--showing that training-st...

Benjamin I. P. Rubinstein, Aleksandr Simma

claim paper

Read More »

199

click to vote

FSS
2008

110views more FSS 2008»

Learning valued preference structures for solving classification problems

15 years 6 months ago

Download www.mathematik.uni-marburg.de

This paper introduces a new approach to classification which combines pairwise decomposition techniques with ideas and tools from fuzzy preference modeling. More specifically, our...

Eyke Hüllermeier, Klaus Brinker

claim paper

Read More »

190

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms

15 years 8 months ago

Download www.cis.upenn.edu

In this paper, we address two issues of long-standing interest in the reinforcement learning literature. First, what kinds of performance guarantees can be made for Q-learning aft...

Michael J. Kearns, Satinder P. Singh

claim paper

Read More »

« Prev « First page 14 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers