Sciweavers

373 search results - page 11 / 75
» Covariant Policy Search
Sort
View
INFORMS
2007
40views more  INFORMS 2007»
13 years 7 months ago
An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes
Jiaqiao Hu, Michael C. Fu, Vahid Reza Ramezani, St...
ICML
2009
IEEE
14 years 8 months ago
Monte-Carlo simulation balancing
In this paper we introduce the first algorithms for efficiently learning a simulation policy for Monte-Carlo search. Our main idea is to optimise the balance of a simulation polic...
David Silver, Gerald Tesauro
JAL
2002
113views more  JAL 2002»
13 years 7 months ago
A multivariate view of random bucket digital search trees
We take a multivariate view of digital search trees by studying the number of nodes of different types that may coexist in a bucket digital search tree as it grows under an arbitr...
Friedrich Hubalek, Hsien-Kuei Hwang, William Lew, ...
ICIP
2008
IEEE
14 years 9 months ago
Kernel-based high-dimensional histogram estimation for visual tracking
We propose an approach for non-rigid tracking that represents objects by their set of distribution parameters. Compared to joint histogram representations, a set of parameters suc...
Allen Tannenbaum, James G. Malcolm, Peter Karasev