Sciweavers

109 search results - page 19 / 22
» Policy teaching through reward function learning
Sort
View
FLAIRS
2000
13 years 9 months ago
A Multiagent Architecture to Support Distance Learning Personalization on the Web
This article presents the description of the objectives, the structure and the functionality of an interactive system intended to focus the teaching on the performanceof the stude...
Jesus Boticario, Elena Gaudioso
MSE
2005
IEEE
133views Hardware» more  MSE 2005»
14 years 1 months ago
Embedded System Design with FPGAs Using HDLs (Lessons Learned and Pitfalls to Be Avoided)
This paper describes the authors experience with teaching VHDL (and more recently, Verilog) to undergraduate and graduate students at WPI and to engineers through various short co...
R. James Duckworth
AAAI
2008
13 years 10 months ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
ICML
1994
IEEE
13 years 11 months ago
A Modular Q-Learning Architecture for Manipulator Task Decomposition
Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...
Chen K. Tham, Richard W. Prager
NIPS
2007
13 years 9 months ago
Bayes-Adaptive POMDPs
Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...