Sciweavers

1233 search results - page 213 / 247
» Reinforcement learning
Sort
View
122
Voted
NN
2006
Springer
140views Neural Networks» more  NN 2006»
15 years 2 months ago
Neural mechanism for stochastic behaviour during a competitive game
Previous studies have shown that non-human primates can generate highly stochastic choice behaviour, especially when this is required during a competitive interaction with another...
Alireza Soltani, Daeyeol Lee, Xiao-Jing Wang
138
Voted
TSMC
2008
135views more  TSMC 2008»
15 years 2 months ago
Wholesale Power Price Dynamics Under Transmission Line Limits: A Use of an Agent-Based Intelligent Simulator
Abstract--This research proposes a use of an agent-based intelligent simulator to numerically examine the influence of a transmission line limit on the dynamics of a wholesale powe...
Toshiyuki Sueyoshi, Gopalakrishna Reddy Tadiparthi
ML
2002
ACM
133views Machine Learning» more  ML 2002»
15 years 2 months ago
Finite-time Analysis of the Multiarmed Bandit Problem
Reinforcement learning policies face the exploration versus exploitation dilemma, i.e. the search for a balance between exploring the environment to find profitable actions while t...
Peter Auer, Nicolò Cesa-Bianchi, Paul Fisch...
132
Voted
COLT
2010
Springer
15 years 19 days ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
158
Voted
JMLR
2010
141views more  JMLR 2010»
14 years 9 months ago
Pinview: Implicit Feedback in Content-Based Image Retrieval
This paper describes Pinview, a content-based image retrieval system that exploits implicit relevance feedback during a search session. Pinview contains several novel methods that...
Peter Auer, Zakria Hussain, Samuel Kaski, Arto Kla...