Sciweavers

453 search results - page 40 / 91
» Learning from actions not taken: a multiagent learning algor...
Sort
View
130
Voted
ICML
2005
IEEE
16 years 4 months ago
A causal approach to hierarchical decomposition of factored MDPs
We present Variable Influence Structure Analysis, an algorithm that dynamically performs hierarchical decomposition of factored Markov decision processes. Our algorithm determines...
Anders Jonsson, Andrew G. Barto
141
Voted
JCP
2008
139views more  JCP 2008»
15 years 3 months ago
Agent Learning in Relational Domains based on Logical MDPs with Negation
In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...
Song Zhiwei, Chen Xiaoping, Cong Shuang
129
Voted
ICMAS
1998
15 years 5 months ago
How to Explore your Opponent's Strategy (almost) Optimally
This work presents a lookahead-based exploration strategy for a model-based learning agent that enables exploration of the opponent's behavior during interaction in a multi-a...
David Carmel, Shaul Markovitch
140
Voted
ICES
1998
Springer
131views Hardware» more  ICES 1998»
15 years 7 months ago
Aspects of Digital Evolution: Geometry and Learning
In this paper we present a new chromosome representation for evolving digital circuits. The representation is based very closely on the chip architecture of the Xilinx 6216 FPGA. W...
Julian F. Miller, Peter Thomson
163
Voted
BMVC
2010
15 years 1 months ago
Histogram of Body Poses and Spectral Regression Discriminant Analysis for Human Action Categorization
This paper explores a recently proposed and rarely reported subspace learning method, Spectral Regression Discriminant Analysis (SRDA) [1, 2], on silhouette based human action rec...
Ling Shao, Xiuli Chen