Sciweavers

34 search results - page 5 / 7
» Towards Finite-Sample Convergence of Direct Reinforcement Le...
Sort
View
137
Voted
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
15 years 10 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
AGENTS
1999
Springer
15 years 8 months ago
Team-Partitioned, Opaque-Transition Reinforcement Learning
In this paper, we present a novel multi-agent learning paradigm called team-partitioned, opaque-transition reinforcement learning (TPOT-RL). TPOT-RL introduces the concept of usin...
Peter Stone, Manuela M. Veloso
PKDD
2009
Springer
184views Data Mining» more  PKDD 2009»
15 years 8 months ago
Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm
Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...
Philippe Rolet, Michèle Sebag, Olivier Teyt...
131
Voted
FBIT
2007
IEEE
15 years 10 months ago
Learning to Drive a Real Car in 20 Minutes
The paper describes our first experiments on Reinforcement Learning to steer a real robot car. The applied method, Neural Fitted Q Iteration (NFQ) is purely data-driven based on ...
Martin Riedmiller, Michael Montemerlo, Hendrik Dah...
COGSR
2011
109views more  COGSR 2011»
14 years 11 months ago
How groups develop a specialized domain vocabulary: A cognitive multi-agent model
We simulate the evolution of a domain vocabulary in small communities. Empirical data show that human communicators can evolve graphical languages quickly in a constrained task (P...
David Reitter, Christian Lebiere