Sciweavers

312 search results - page 35 / 63
» Learning Partially Observable Deterministic Action Models
Sort
View
ATAL
2010
Springer
13 years 7 months ago
PAC-MDP learning with knowledge-based admissible models
PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...
Marek Grzes, Daniel Kudenko
ICRA
2008
IEEE
208views Robotics» more  ICRA 2008»
14 years 2 months ago
Unsupervised body scheme learning through self-perception
— In this paper, we present an approach allowing a robot to learn a generative model of its own physical body from scratch using self-perception with a single monocular camera. O...
Jürgen Sturm, Christian Plagemann, Wolfram Bu...
ECAI
2010
Springer
13 years 8 months ago
The Dynamics of Multi-Agent Reinforcement Learning
Abstract. Infinite-horizon multi-agent control processes with nondeterminism and partial state knowledge have particularly interesting properties with respect to adaptive control, ...
Luke Dickens, Krysia Broda, Alessandra Russo
DATE
2008
IEEE
136views Hardware» more  DATE 2008»
14 years 2 months ago
A Framework of Stochastic Power Management Using Hidden Markov Model
- The effectiveness of stochastic power management relies on the accurate system and workload model and effective policy optimization. Workload modeling is a machine learning proce...
Ying Tan, Qinru Qiu
ICML
1990
IEEE
13 years 11 months ago
Explanations of Empirically Derived Reactive Plans
Given an adequate simulation model of the task environment and payoff function that measures the quality of partially successful plans, competition-based heuristics such as geneti...
Diana F. Gordon, John J. Grefenstette