Search Sciweavers | Sciweavers

50 search results - page 5 / 10

» Nonparametric Return Distribution Approximation for Reinforc...

129

Voted

ICML
2007
IEEE

180views Machine Learning» more ICML 2007»

Bayesian actor-critic algorithms

16 years 3 months ago

Download www.machinelearning.org

We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...

Mohammad Ghavamzadeh, Yaakov Engel

claim paper

Read More »

123

Voted

RSS
2007

159views Robotics» more RSS 2007»

Gaussian Beam Processes: A Nonparametric Bayesian Measurement Model for Range Finders

15 years 3 months ago

Download www.roboticsproceedings.org

— In probabilistic mobile robotics, the development of measurement models plays a crucial role as it directly inﬂuences the efﬁciency and the robustness of the robot’s perf...

Christian Plagemann, Kristian Kersting, Patrick Pf...

claim paper

Read More »

127

click to vote

COMCOM
2008

127views more COMCOM 2008»

A dynamic routing protocol for keyword search in unstructured peer-to-peer networks

15 years 2 months ago

Download www.cc.gatech.edu

The idea of building query-oriented routing indices has changed the way of improving keyword search efficiency from the basis as it can learn the content distribution from the que...

Cong Shi, Dingyi Han, Yuanjie Liu, Shicong Meng, Y...

claim paper

Read More »

142

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

15 years 4 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

106

click to vote

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

15 years 7 months ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

« Prev « First page 5 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers