Search Sciweavers | Sciweavers

3628 search results - page 190 / 726

» The Decision Diffie-Hellman Problem

109

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 7 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

121

Voted

COLT
2004
Springer

161views Machine Learning» more COLT 2004»

Oracle Bounds and Exact Algorithm for Dyadic Classification Trees

15 years 7 months ago

Download ml.cs.tu-berlin.de

This paper introduces a new method using dyadic decision trees for estimating a classification or a regression function in a multiclass classification problem. The estimator is bas...

Gilles Blanchard, Christin Schäfer, Yves Roze...

claim paper

Read More »

119

Voted

ECAI
2004
Springer

123views Artificial Intelligence» more ECAI 2004»

Many Hands Make Light Work: Localized Satisfiability for Multi-Context Systems

15 years 7 months ago

Download dit.unitn.it

In this paper, we tackle the satisfiability problem for multi-context systems. First, we establish a satisfiability algorithm based on an encoding into propositional logic. Then, w...

Floris Roelofsen, Luciano Serafini, Alessandro Cim...

claim paper

Read More »

131

click to vote

ECML
2006
Springer

112views Machine Learning» more ECML 2006»

Bandit Based Monte-Carlo Planning

15 years 7 months ago

Download www.lri.fr

Abstract. For large state-space Markovian Decision Problems MonteCarlo planning is one of the few viable approaches to find near-optimal solutions. In this paper we introduce a new...

Levente Kocsis, Csaba Szepesvári

claim paper

Read More »

132

Voted

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

15 years 4 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

« Prev « First page 190 / 726 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers