Sciweavers

163 search results - page 23 / 33
» Policy Gradient Methods for Robotics
Sort
View
NIPS
1992
13 years 9 months ago
Explanation-Based Neural Network Learning for Robot Control
How can artificial neural nets generalize better from fewer examples? In order to generalize successfully, neural network learning methods typically require large training data se...
Tom M. Mitchell, Sebastian Thrun
AI
1999
Springer
13 years 7 months ago
Cooperative Behavior Acquisition for Mobile Robots in Dynamically Changing Real Worlds Via Vision-Based Reinforcement Learning a
In this paper, we first discuss the meaning of physical embodiment and the complexity of the environment in the context of multi-agent learning. We then propose a vision-based rei...
Minoru Asada, Eiji Uchibe, Koh Hosoda
ICRA
2010
IEEE
133views Robotics» more  ICRA 2010»
13 years 6 months ago
Variable resolution decomposition for robotic navigation under a POMDP framework
— Partially Observable Markov Decision Processes (POMDPs) offer a powerful mathematical framework for making optimal action choices in noisy and/or uncertain environments, in par...
Robert Kaplow, Amin Atrash, Joelle Pineau
CORR
2011
Springer
219views Education» more  CORR 2011»
13 years 2 months ago
Active Markov Information-Theoretic Path Planning for Robotic Environmental Sensing
Recent research in multi-robot exploration and mapping has focused on sampling environmental fields, which are typically modeled using the Gaussian process (GP). Existing informa...
Kian Hsiang Low, John M. Dolan, Pradeep K. Khosla
IROS
2007
IEEE
132views Robotics» more  IROS 2007»
14 years 2 months ago
Hysteretic q-learning : an algorithm for decentralized reinforcement learning in cooperative multi-agent teams
— Multi-agent systems (MAS) are a field of study of growing interest in a variety of domains such as robotics or distributed controls. The article focuses on decentralized reinf...
Laëtitia Matignon, Guillaume J. Laurent, Nadi...