Search Sciweavers | Sciweavers

153

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 7 months ago

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

158

click to vote

AAAI
2006

129views Intelligent Agents» more AAAI 2006»

Simultaneous Team Assignment and Behavior Recognition from Spatio-Temporal Agent Traces

15 years 7 months ago

Download www.cs.cmu.edu

This paper addresses the problem of activity recognition for physically-embodied agent teams. We define team activity recognition as the process of identifying team behaviors from...

Gita Sukthankar, Katia P. Sycara

claim paper

Read More »

181

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 7 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

160

click to vote

AAAI
2006

128views Intelligent Agents» more AAAI 2006»

QUICR-Learning for Multi-Agent Coordination

15 years 7 months ago

Download www.aaai.org

Coordinating multiple agents that need to perform a sequence of actions to maximize a system level reward requires solving two distinct credit assignment problems. First, credit m...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

127

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Goal Specification, Non-Determinism and Quantifying over Policies

15 years 7 months ago

Download www.public.asu.edu

One important aspect in directing cognitive robots or agents is to formally specify what is expected of them. This is often referred to as goal specification. Temporal logics such...

Chitta Baral, Jicheng Zhao

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers