Sciweavers

4544 search results - page 72 / 909
» Reinforcement Learning with Time
Sort
View
ATAL
2006
Springer
14 years 18 days ago
Convergence analysis for collective vocabulary development
We study how decentralized agents can develop a shared vocabulary without global coordination. Answering this question can help us understand the emergence of many communication s...
Jun Wang, Les Gasser, Jim Houk
JACIII
2006
97views more  JACIII 2006»
13 years 8 months ago
Opposition-Based Reinforcement Learning
In this paper a method for image segmentation using an opposition-based reinforcement learning scheme is introduced. We use this agent-based approach to optimally find the appropri...
Hamid R. Tizhoosh
ICML
2004
IEEE
14 years 9 months ago
Dynamic abstraction in reinforcement learning via clustering
Abstraction in Reinforcement Learning via Clustering Shie Mannor shie@mit.edu Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA ...
Shie Mannor, Ishai Menache, Amit Hoze, Uri Klein
ICML
2002
IEEE
14 years 9 months ago
Discovering Hierarchy in Reinforcement Learning with HEXQ
An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...
Bernhard Hengst
ECML
2006
Springer
13 years 10 months ago
Reinforcement Learning for MDPs with Constraints
In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...
Peter Geibel