Search Sciweavers | Sciweavers

4973 search results - page 916 / 995

» Probabilistic Algorithms in Robotics

140

click to vote

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 2 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

152

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

15 years 5 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

134

click to vote

IANDC
2008

92views more IANDC 2008»

Tree exploration with advice

15 years 4 months ago

Download www.liafa.jussieu.fr

We study the amount of knowledge about the network that is required in order to efficiently solve a task concerning this network. The impact of available information on the effici...

Pierre Fraigniaud, David Ilcinkas, Andrzej Pelc

claim paper

Read More »

159

click to vote

CP
2010
Springer

136views Artificial Intelligence» more CP 2010»

A Box-Consistency Contractor Based on Extremal Functions

15 years 2 months ago

Download www-sop.inria.fr

Abstract. Interval-based methods can approximate all the real solutions of a system of equations and inequalities. The Box interval constraint propagation algorithm enforces Box co...

Gilles Trombettoni, Yves Papegay, Gilles Chabert, ...

claim paper

Read More »

147

click to vote

CRV
2009
IEEE

115views Robotics» more CRV 2009»

Learning Model Complexity in an Online Environment

15 years 11 months ago

Download cbcl.mit.edu

In this paper we introduce the concept and method for adaptively tuning the model complexity in an online manner as more examples become available. Challenging classiﬁcation pro...

Dan Levi, Shimon Ullman

claim paper

Read More »

« Prev « First page 916 / 995 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers