Sciweavers

827 search results - page 85 / 166
» Variational methods for Reinforcement Learning
Sort
View

Publication
222views
14 years 4 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis
MSWIM
2005
ACM
14 years 1 months ago
A swarm intelligent multi-path routing for multimedia traffic over mobile ad hoc networks
In the last few years, the advance of multimedia applications has prompted researchers to undertake the task of routing multimedia data through Manet. This task is rather difficul...
Saida Ziane, Abdelhamid Mellouk
JCST
2010
109views more  JCST 2010»
13 years 2 months ago
The Inverse Classification Problem
In this paper, we examine an emerging variation of the classification problem, which is known as the inverse classification problem. In this problem, we determine the features to b...
Charu C. Aggarwal, Chen Chen, Jiawei Han
ACL
2010
13 years 5 months ago
Learning Common Grammar from Multilingual Corpus
We propose a corpus-based probabilistic framework to extract hidden common syntax across languages from non-parallel multilingual corpora in an unsupervised fashion. For this purp...
Tomoharu Iwata, Daichi Mochihashi, Hiroshi Sawada
JMLR
2010
147views more  JMLR 2010»
13 years 2 months ago
Gaussian Processes for Machine Learning (GPML) Toolbox
The GPML toolbox provides a wide range of functionality for Gaussian process (GP) inference and prediction. GPs are specified by mean and covariance functions; we offer a library ...
Carl Edward Rasmussen, Hannes Nickisch