Sciweavers

1880 search results - page 122 / 376
» Robust Learning - Rich and Poor
Sort
View
VISAPP
2008
15 years 6 months ago
Towards the Estimation of Conspicuity with Visual Priors
Traffic signs are designed to be clearly seen by drivers. However a little is known about the visual influence of the traffic sign environment on how it will be perceived. Computer...
Ludovic Simon, Jean-Philippe Tarel, Roland Bremond
JMLR
2010
139views more  JMLR 2010»
14 years 11 months ago
Tempered Markov Chain Monte Carlo for training of Restricted Boltzmann Machines
Alternating Gibbs sampling is the most common scheme used for sampling from Restricted Boltzmann Machines (RBM), a crucial component in deep architectures such as Deep Belief Netw...
Guillaume Desjardins, Aaron C. Courville, Yoshua B...
128
Voted
GECCO
2009
Springer
135views Optimization» more  GECCO 2009»
15 years 11 months ago
Neuroevolutionary reinforcement learning for generalized helicopter control
Helicopter hovering is an important challenge problem in the field of reinforcement learning. This paper considers several neuroevolutionary approaches to discovering robust cont...
Rogier Koppejan, Shimon Whiteson
165
Voted
AAMAS
2007
Springer
15 years 10 months ago
Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game
Abstract. The application of reinforcement learning algorithms to multiagent domains may cause complex non-convergent dynamics. The replicator dynamics, commonly used in evolutiona...
Alessandro Lazaric, Jose Enrique Munoz de Cote, Fa...
146
Voted
ICML
1996
IEEE
15 years 8 months ago
Discovering Structure in Multiple Learning Tasks: The TC Algorithm
Recently, there has been an increased interest in "lifelong" machine learning methods, that transfer knowledge across multiple learning tasks. Such methods have repeated...
Sebastian Thrun, Joseph O'Sullivan