Sciweavers

1027 search results - page 89 / 206
» Learning Similar Tasks From Observation and Practice
Sort
View
126
Voted
ECML
2007
Springer
15 years 10 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
144
Voted
KDD
2008
ACM
259views Data Mining» more  KDD 2008»
16 years 4 months ago
Using ghost edges for classification in sparsely labeled networks
We address the problem of classification in partially labeled networks (a.k.a. within-network classification) where observed class labels are sparse. Techniques for statistical re...
Brian Gallagher, Hanghang Tong, Tina Eliassi-Rad, ...
135
Voted
JMLR
2010
136views more  JMLR 2010»
14 years 10 months ago
Conceptual Imitation Learning: An Application to Human-robot Interaction
In general, imitation is imprecisely used to address different levels of social learning from high level knowledge transfer to low level regeneration of motor commands. However, t...
Hossein Hajimirsadeghi, Majid Nili Ahmadabadi, Mos...
156
Voted
ACL
2009
15 years 1 months ago
A Graph-based Semi-Supervised Learning for Question-Answering
We present a graph-based semi-supervised learning for the question-answering (QA) task for ranking candidate sentences. Using textual entailment analysis, we obtain entailment sco...
Asli Çelikyilmaz, Marcus Thint, Zhiheng Hua...
133
Voted
ICPR
2000
IEEE
16 years 4 months ago
Image Distance Using Hidden Markov Models
We describe a method for learning statistical models of images using a second-order hidden Markov mesh model. First, an image can be segmented in a way that best matches its stati...
Daniel DeMenthon, David S. Doermann, Marc Vuilleum...