Sciweavers

1227 search results - page 187 / 246
» Learning Rates for Q-Learning
Sort
View
PRICAI
2010
Springer
13 years 8 months ago
Visual Query Expansion via Incremental Hypernetwork Models of Image and Text
Abstract. Humans can associate vision and language modalities and thus generate mental imagery, i.e. visual images, from linguistic input in an environment of unlimited inflowing i...
Min-Oh Heo, Myunggu Kang, Byoung-Tak Zhang
COLT
2010
Springer
13 years 8 months ago
Best Arm Identification in Multi-Armed Bandits
We consider the problem of finding the best arm in a stochastic multi-armed bandit game. The regret of a forecaster is here defined by the gap between the mean reward of the optim...
Jean-Yves Audibert, Sébastien Bubeck, R&eac...
ICANN
2010
Springer
13 years 8 months ago
Dynamics and Function of a CA1 Model of the Hippocampus during Theta and Ripples
The hippocampus is known to be involved in spatial learning in rats. Spatial learning involves the encoding and replay of temporally sequenced spatial information. Temporally seque...
Vassilis Cutsuridis, Michael E. Hasselmo
ML
2010
ACM
138views Machine Learning» more  ML 2010»
13 years 4 months ago
Mining adversarial patterns via regularized loss minimization
Traditional classification methods assume that the training and the test data arise from the same underlying distribution. However, in several adversarial settings, the test set is...
Wei Liu, Sanjay Chawla
ICASSP
2011
IEEE
13 years 1 months ago
Application specific loss minimization using gradient boosting
Gradient boosting is a flexible machine learning technique that produces accurate predictions by combining many weak learners. In this work, we investigate its use in two applica...
Bin Zhang, Abhinav Sethy, Tara N. Sainath, Bhuvana...