Sciweavers

165 search results - page 31 / 33
» Exploration and apprenticeship learning in reinforcement lea...
Sort
View
JAIR
2011
187views more  JAIR 2011»
13 years 1 months ago
A Monte-Carlo AIXI Approximation
This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...
Joel Veness, Kee Siong Ng, Marcus Hutter, William ...
COLT
2010
Springer
13 years 5 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura
ICANN
2005
Springer
14 years 14 days ago
Evolving Modular Fast-Weight Networks for Control
Abstract. In practice, almost all control systems in use today implement some form of linear control. However, there are many tasks for which conventional control engineering metho...
Faustino J. Gomez, Jürgen Schmidhuber
ECML
2003
Springer
14 years 5 days ago
Optimising Performance of Competing Search Engines in Heterogeneous Web Environments
Abstract. Distributed heterogeneous search environments are an emerging phenomenon in Web search, in which topic-specific search engines provide search services, and metasearchers...
Rinat Khoussainov, Nicholas Kushmerick
ICRA
2010
IEEE
162views Robotics» more  ICRA 2010»
13 years 5 months ago
Adaptive multi-robot coordination: A game-theoretic perspective
Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...
Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus