Search Sciweavers | Sciweavers

651 search results - page 83 / 131

» Algorithms for Inverse Reinforcement Learning

161

click to vote

BC
1998

109views more BC 1998»

Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity

15 years 5 months ago

Download lis.epfl.ch

Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...

Javier Zamora, José del R. Millán, A...

claim paper

Read More »

202

click to vote

RECOMB
2009
Springer

214views Computational Biology» more RECOMB 2009»

Learning Models for Aligning Protein Sequences with Predicted Secondary Structure

16 years 6 months ago

Download www.cs.arizona.edu

Accurately aligning distant protein sequences is notoriously difficult. A recent approach to improving alignment accuracy is to use additional information such as predicted seconda...

Eagu Kim, Travis J. Wheeler, John D. Kececioglu

claim paper

Read More »

185

click to vote

ICML
2000
IEEE

153views Machine Learning» more ICML 2000»

Eligibility Traces for Off-Policy Policy Evaluation

16 years 7 months ago

Download www.cs.ualberta.ca

Eligibility traces have been shown to speed reinforcement learning, to make it more robust to hidden states, and to provide a link between Monte Carlo and temporal-difference meth...

Doina Precup, Richard S. Sutton, Satinder P. Singh

claim paper

Read More »

160

click to vote

NIPS
1996

112views Information Technology» more NIPS 1996»

Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning

15 years 7 months ago

Download www.ri.cmu.edu

Model learning combined with dynamic programming has been shown to be e ective for learning control of continuous state dynamic systems. The simplest method assumes the learned mod...

Jeff G. Schneider

claim paper

Read More »

152

click to vote

STOC
2010
ACM

195views Algorithms» more STOC 2010»

Efficiently Learning Mixtures of Two Gaussians

15 years 10 months ago

Download www.cs.berkeley.edu

Given data drawn from a mixture of multivariate Gaussians, a basic problem is to accurately estimate the mixture parameters. We provide a polynomial-time algorithm for this proble...

Adam Tauman Kalai, Ankur Moitra, and Gregory Valia...

claim paper

Read More »

« Prev « First page 83 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers