Sciweavers

1166 search results - page 156 / 234
» Negotiating Using Rewards
Sort
View
ICML
2005
IEEE
14 years 11 months ago
Exploration and apprenticeship learning in reinforcement learning
We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...
Pieter Abbeel, Andrew Y. Ng
IROS
2009
IEEE
154views Robotics» more  IROS 2009»
14 years 4 months ago
Consideration on robotic giant-swing motion generated by reinforcement learning
—This study attempts to make a compact humanoid robot acquire a giant-swing motion without any robotic models by using reinforcement learning; only the interaction with environme...
Masayuki Hara, Naoto Kawabe, Naoki Sakai, Jian Hua...
IROS
2009
IEEE
195views Robotics» more  IROS 2009»
14 years 4 months ago
Appearance contrast for fast, robust trail-following
— We describe a framework for finding and tracking “trails” for autonomous outdoor robot navigation. Through a combination of visual cues and ladar-derived structural inform...
Christopher Rasmussen, Yan Lu, Mehmet Kocamaz
CSE
2008
IEEE
14 years 4 months ago
Adaptation to Dynamic Resource Availability in Ad Hoc Grids through a Learning Mechanism
Ad-hoc Grids are highly heterogeneous and dynamic networks, one of the main challenges of resource allocation in such environments is to find mechanisms which do not rely on the ...
Behnaz Pourebrahimi, Koen Bertels
IJCNN
2007
IEEE
14 years 4 months ago
Predictive E-Mail Server Performability Analysis Based on Fuzzy Arithmetic
–The performability of disk arrays systems has been studied before. However, in the case of imprecise data, a fuzzy model can be the base for the performability analysis. This pa...
Guillermo Navarro, Milos Manic