Sciweavers

1138 search results - page 159 / 228
» Feature Markov Decision Processes
Sort
View
113
Voted
ICASSP
2011
IEEE
14 years 7 months ago
Estimation of ordinal approach-avoidance labels in dyadic interactions: Ordinal logistic regression approach
Behavioral Signal Processing aims at automating behavioral coding schemes such as those prevalent in psychology and mental health research. This paper describes methods to quantif...
Viktor Rozgic, Bo Xiao, Athanasios Katsamanis, Bri...
DEXA
2010
Springer
177views Database» more  DEXA 2010»
15 years 4 months ago
Enhanced Foundry Production Control
Mechanical properties are the attributes that measure the faculty of a metal to withstand several loads and tensions. Specifically, ultimate tensile strength is the force a materia...
Javier Nieves, Igor Santos, Yoseba K. Penya, Felix...
128
Voted
ICML
2009
IEEE
16 years 4 months ago
Predictive representations for policy gradient in POMDPs
We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...
Abdeslam Boularias, Brahim Chaib-draa
139
Voted
ICML
2007
IEEE
16 years 4 months ago
Learning state-action basis functions for hierarchical MDPs
This paper introduces a new approach to actionvalue function approximation by learning basis functions from a spectral decomposition of the state-action manifold. This paper exten...
Sarah Osentoski, Sridhar Mahadevan
ICML
2007
IEEE
16 years 4 months ago
Conditional random fields for multi-agent reinforcement learning
Conditional random fields (CRFs) are graphical models for modeling the probability of labels given the observations. They have traditionally been trained with using a set of obser...
Xinhua Zhang, Douglas Aberdeen, S. V. N. Vishwanat...