Sciweavers

3381 search results - page 202 / 677
» LEO - DB2's LEarning Optimizer
Sort
View
NIPS
2007
15 years 6 months ago
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
We consider apprenticeship learning—learning from expert demonstrations—in the setting of large, complex domains. Past work in apprenticeship learning requires that the expert...
J. Zico Kolter, Pieter Abbeel, Andrew Y. Ng
138
Voted
BC
1998
109views more  BC 1998»
15 years 4 months ago
Learning and stabilization of altruistic behaviors in multi-agent systems by reciprocity
Optimization of performance in collective systems often requires altruism. The emergence and stabilization of altruistic behaviors are dicult to achieve because the agents incur ...
Javier Zamora, José del R. Millán, A...
152
Voted
ML
2007
ACM
108views Machine Learning» more  ML 2007»
15 years 4 months ago
Unconditional lower bounds for learning intersections of halfspaces
We prove new lower bounds for learning intersections of halfspaces, one of the most important concept classes in computational learning theory. Our main result is that any statist...
Adam R. Klivans, Alexander A. Sherstov
GECCO
2009
Springer
162views Optimization» more  GECCO 2009»
15 years 2 months ago
Uncertainty handling CMA-ES for reinforcement learning
The covariance matrix adaptation evolution strategy (CMAES) has proven to be a powerful method for reinforcement learning (RL). Recently, the CMA-ES has been augmented with an ada...
Verena Heidrich-Meisner, Christian Igel
AIM
2011
14 years 8 months ago
Transfer Learning by Reusing Structured Knowledge
Transfer learning aims to solve new learning problems by extracting and making use of the common knowledge found in related domains. A key element of transfer learning is to ident...
Qiang Yang, Vincent Wenchen Zheng, Bin Li, Hankz H...