Sciweavers

711 search results - page 119 / 143
» Interactive optimization in cooperative environments
Sort
View
ICML
2008
IEEE
14 years 10 months ago
Non-parametric policy gradients: a unified treatment of propositional and relational domains
Policy gradient approaches are a powerful instrument for learning how to interact with the environment. Existing approaches have focused on propositional and continuous domains on...
Kristian Kersting, Kurt Driessens
ICML
2006
IEEE
14 years 10 months ago
Qualitative reinforcement learning
When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...
Arkady Epshteyn, Gerald DeJong
ICRA
2007
IEEE
134views Robotics» more  ICRA 2007»
14 years 3 months ago
Towards a Real-Time Bayesian Imitation System for a Humanoid Robot
Abstract— Imitation learning, or programming by demonstration (PbD), holds the promise of allowing robots to acquire skills from humans with domain-specific knowledge, who nonet...
Aaron P. Shon, Joshua J. Storz, Rajesh P. N. Rao
SECON
2007
IEEE
14 years 3 months ago
Self-Learning Repeated Game Framework for Distributed Primary-Prioritized Dynamic Spectrum Access
Dynamic spectrum access has become a promising approach to fully utilize the scarce spectrum resources. In a dynamically changing spectrum environment, it is very important to desi...
Beibei Wang, Zhu Ji, K. J. Ray Liu
CCGRID
2006
IEEE
14 years 3 months ago
Integrating Gridcomputing and Metamodeling
Simulation and optimization of complex mechanical and electronical systems is a very time consuming and computationally intensive task. Therefore, metamodeling techniques are ofte...
Dirk Gorissen, Wouter Hendrickx, Karel Crombecq, T...