Sciweavers

1166 search results - page 93 / 234
» Negotiating Using Rewards
Sort
View
ICRA
2009
IEEE
143views Robotics» more  ICRA 2009»
14 years 4 months ago
Least absolute policy iteration for robust value function approximation
Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efficiency. However, it tends to be sensitive to outliers...
Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...
BMCV
2000
Springer
14 years 2 months ago
Unsupervised Learning of Biologically Plausible Object Recognition Strategies
Recent psychological and neurological evidence suggests that biological object recognition is a process of matching sensed images to stored iconic memories. This paper presents a p...
Bruce A. Draper, Kyungim Baek
ICALT
2007
IEEE
13 years 12 months ago
Evaluating the automatic and manual creation process of adaptive lessons
Using adaptive, personalized courses is rewarding, as it can create a better learning experience, tailored for a specific learner’s needs. The process of creating these courses,...
Maurice Hendrix, Alexandra I. Cristea, Mike Joy
AAMAS
2002
Springer
13 years 10 months ago
Relational Reinforcement Learning for Agents in Worlds with Objects
In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action in a given state of the environment, so that it maximizes the total amount of reward it ...
Saso Dzeroski
NIPS
2001
13 years 11 months ago
Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning
Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...
Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...