Sciweavers

779 search results - page 96 / 156
» Reinforcement Using Supervised Learning for Policy Generaliz...
Sort
View
ATAL
2010
Springer
13 years 11 months ago
Cultivating desired behaviour: policy teaching via environment-dynamics tweaks
In this paper we study, for the first time explicitly, the implications of endowing an interested party (i.e. a teacher) with the ability to modify the underlying dynamics of the ...
Zinovi Rabinovich, Lachlan Dufton, Kate Larson, Ni...
JMLR
2010
119views more  JMLR 2010»
13 years 5 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
CI
2005
106views more  CI 2005»
13 years 10 months ago
Incremental Learning of Procedural Planning Knowledge in Challenging Environments
Autonomous agents that learn about their environment can be divided into two broad classes. One class of existing learners, reinforcement learners, typically employ weak learning ...
Douglas J. Pearson, John E. Laird
CCS
2009
ACM
14 years 5 months ago
Inferring privacy policies for social networking services
Social networking sites have come under criticism for their poor privacy protection track record. Yet, there is an inherent difficulty in deciding which principals should have acc...
George Danezis
ICTAI
2006
IEEE
14 years 4 months ago
MI-Winnow: A New Multiple-Instance Learning Algorithm
We present MI-Winnow, a new multiple-instance learning (MIL) algorithm that provides a new technique to convert MIL data into standard supervised data. In MIL each example is a co...
Sharath R. Cholleti, Sally A. Goldman, Rouhollah R...