Search Sciweavers | Sciweavers

181 search results - page 29 / 37

» On Policy Learning in Restricted Policy Spaces

click to vote

UAI
2001

113views Artificial Intelligence» more UAI 2001»

Improved learning of Bayesian networks

13 years 9 months ago

Download functionalgenomics.upf.edu

The search space of Bayesian Network structures is usually defined as Acyclic Directed Graphs (DAGs) and the search is done by local transformations of DAGs. But the space of Baye...

Tomás Kocka, Robert Castelo

claim paper

Read More »

click to vote

ICDM
2002
IEEE

105views Data Mining» more ICDM 2002»

Empirical Comparison of Various Reinforcement Learning Strategies for Sequential Targeted Marketing

14 years 17 days ago

Download www.weifan.info

We empirically evaluate the performance of various reinforcement learning methods in applications to sequential targeted marketing. In particular, we propose and evaluate a progre...

Naoki Abe, Edwin P. D. Pednault, Haixun Wang, Bian...

claim paper

Read More »

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 9 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

click to vote

ICML
2007
IEEE

204views Machine Learning» more ICML 2007»

Constructing basis functions from directed graphs for value function approximation

14 years 8 months ago

Download www.machinelearning.org

Basis functions derived from an undirected graph connecting nearby samples from a Markov decision process (MDP) have proven useful for approximating value functions. The success o...

Jeffrey Johns, Sridhar Mahadevan

claim paper

Read More »

click to vote

NN
2007
Springer

105views Neural Networks» more NN 2007»

Guiding exploration by pre-existing knowledge without modifying reward

13 years 7 months ago

Download www.cs.hut.fi

Reinforcement learning is based on exploration of the environment and receiving reward that indicates which actions taken by the agent are good and which ones are bad. In many app...

Kary Främling

claim paper

Read More »

« Prev « First page 29 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers