Sciweavers

34 search results - page 5 / 7
» Towards Finite-Sample Convergence of Direct Reinforcement Le...
Sort
View
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
14 years 26 days ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
AGENTS
1999
Springer
13 years 11 months ago
Team-Partitioned, Opaque-Transition Reinforcement Learning
In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...
Peter Stone, Manuela M. Veloso
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
13 years 11 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...
FBIT
2007
IEEE
14 years 1 months ago
Learning to Drive a Real Car in 20 Minutes
The paper describes our first experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...
Martin Riedmiller, Michael Montemerlo, Hendrik Dah...
COGSR
2011
109views more  COGSR 2011»
13 years 1 months ago
How groups develop a specialized domain vocabulary: A cognitive multi-agent model
We simulate the evolution of a domain vocabulary in small communities. Empirical data show that human communicators can evolve graphical languages quickly in a constrained task (P...
David Reitter, Christian Lebiere