Search Sciweavers | Sciweavers

165 search results - page 31 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

13 years 1 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

13 years 5 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

click to vote

ICANN
2005
Springer

142views Neural Networks» more ICANN 2005»

Evolving Modular Fast-Weight Networks for Control

14 years 14 days ago

Download www.idsia.ch

Abstract. In practice, almost all control systems in use today implement some form of linear control. However, there are many tasks for which conventional control engineering metho...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

click to vote

ECML
2003
Springer

129views Machine Learning» more ECML 2003»

Optimising Performance of Competing Search Engines in Heterogeneous Web Environments

14 years 5 days ago

Download userweb.port.ac.uk

Abstract. Distributed heterogeneous search environments are an emerging phenomenon in Web search, in which topic-speciﬁc search engines provide search services, and metasearchers...

Rinat Khoussainov, Nicholas Kushmerick

claim paper

Read More »

click to vote

ICRA
2010
IEEE

162views Robotics» more ICRA 2010»

Adaptive multi-robot coordination: A game-theoretic perspective

13 years 5 months ago

Download teamcore.usc.edu

Multi-robot systems researchers have been investigating adaptive coordination methods for improving spatial coordination in teams. Such methods adapt the coordination method to th...

Gal A. Kaminka, Dan Erusalimchik, Sarit Kraus

claim paper

Read More »

« Prev « First page 31 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers