Sciweavers

827 search results - page 63 / 166
» Variational methods for Reinforcement Learning
Sort
View
IROS
2006
IEEE
187views Robotics» more  IROS 2006»
14 years 1 months ago
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic
— Recently, many researchers on humanoid robotics are interested in Quasi-Passive-Dynamic Walking (Quasi-PDW) which is similar to human walking. It is desirable that control para...
Tsuyoshi Ueno, Yutaka Nakamura, Takashi Takuma, To...
IROS
2008
IEEE
165views Robotics» more  IROS 2008»
14 years 2 months ago
Mutual development of behavior acquisition and recognition based on value system
Abstract. Both self-learning architecture (embedded structure) and explicit/implicit teaching from other agents (environmental design issue) are necessary not only for one behavior...
Yasutake Takahashi, Yoshihiro Tamura, Minoru Asada
ATAL
2010
Springer
13 years 8 months ago
PAC-MDP learning with knowledge-based admissible models
PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...
Marek Grzes, Daniel Kudenko
CORR
2011
Springer
150views Education» more  CORR 2011»
13 years 2 months ago
Total variation regularization for fMRI-based prediction of behaviour
—While medical imaging typically provides massive amounts of data, the extraction of relevant information for predictive diagnosis remains a difficult challenge. Functional MRI ...
Vincent Michel, Alexandre Gramfort, Gaël Varo...
ESANN
2008
13 years 9 months ago
Improvement in Game Agent Control Using State-Action Value Scaling
The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...
Leo Galway, Darryl Charles, Michaela M. Black