Sciweavers

651 search results - page 83 / 131
» Algorithms for Inverse Reinforcement Learning
Sort
View
118
Voted
BC
1998
109views more  BC 1998»
15 years 2 months ago
Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity
Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...
Javier Zamora, José del R. Millán, A...
157
Voted
RECOMB
2009
Springer
16 years 3 months ago
Learning Models for Aligning Protein Sequences with Predicted Secondary Structure
Accurately aligning distant protein sequences is notoriously difficult. A recent approach to improving alignment accuracy is to use additional information such as predicted seconda...
Eagu Kim, Travis J. Wheeler, John D. Kececioglu
148
Voted
ICML
2000
IEEE
16 years 3 months ago
Eligibility Traces for Off-Policy Policy Evaluation
Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...
Doina Precup, Richard S. Sutton, Satinder P. Singh
NIPS
1996
15 years 3 months ago
Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning
Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...
Jeff G. Schneider
STOC
2010
ACM
195views Algorithms» more  STOC 2010»
15 years 6 months ago
Efficiently Learning Mixtures of Two Gaussians
Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate the mixture parameters. We provide a polynomial-time algorithm for this proble...
Adam Tauman Kalai, Ankur Moitra, and Gregory Valia...