Sciweavers

377 search results - page 51 / 76
» aaai 2006
Sort
View
AAAI
2006
13 years 9 months ago
Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains
We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...
Vishal Soni, Satinder P. Singh
AAAI
2006
13 years 9 months ago
Simultaneous Team Assignment and Behavior Recognition from Spatio-Temporal Agent Traces
This paper addresses the problem of activity recognition for physically-embodied agent teams. We define team activity recognition as the process of identifying team behaviors from...
Gita Sukthankar, Katia P. Sycara
AAAI
2006
13 years 9 months ago
Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning
Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...
Shimon Whiteson, Peter Stone
AAAI
2006
13 years 9 months ago
QUICR-Learning for Multi-Agent Coordination
Coordinating multiple agents that need to perform a sequence of actions to maximize a system level reward requires solving two distinct credit assignment problems. First, credit m...
Adrian K. Agogino, Kagan Tumer
AAAI
2006
13 years 9 months ago
Goal Specification, Non-Determinism and Quantifying over Policies
One important aspect in directing cognitive robots or agents is to formally specify what is expected of them. This is often referred to as goal specification. Temporal logics such...
Chitta Baral, Jicheng Zhao