Sciweavers

36 search results - page 6 / 8
» Posterior Weighted Reinforcement Learning with State Uncerta...
Sort
View
PKDD
2010
Springer
164views Data Mining» more  PKDD 2010»
13 years 5 months ago
Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations
Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...
Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...
TSMC
2002
129views more  TSMC 2002»
13 years 7 months ago
A distributed robotic control system based on a temporal self-organizing neural network
A distributed robot control system is proposed based on a temporal self-organizing neural network, called competitive and temporal Hebbian (CTH) network. The CTH network can learn ...
Guilherme De A. Barreto, Aluizio F. R. Araú...
ECML
2006
Springer
13 years 11 months ago
Efficient Non-linear Control Through Neuroevolution
Abstract. Many complex control problems are not amenable to traditional controller design. Not only is it difficult to model real systems, but often it is unclear what kind of beha...
Faustino J. Gomez, Jürgen Schmidhuber, Risto ...
ICASSP
2010
IEEE
13 years 7 months ago
HMM-based sequence-to-frame mapping for voice conversion
Voice conversion can be reduced to a problem to find a transformation function between the corresponding speech sequences of two speakers. Perhaps the most voice conversions meth...
Yu Qiao, Daisuke Saito, Nobuaki Minematsu
ICML
1999
IEEE
14 years 8 months ago
Distributed Value Functions
Many interesting problems, such as power grids, network switches, and tra c ow, that are candidates for solving with reinforcement learningRL, alsohave properties that make distri...
Jeff G. Schneider, Weng-Keen Wong, Andrew W. Moore...