Sciweavers

109 search results - page 13 / 22
» Policy teaching through reward function learning
Sort
View
CORR
2010
Springer
171views Education» more  CORR 2010»
13 years 2 months ago
Online Learning in Opportunistic Spectrum Access: A Restless Bandit Approach
We consider an opportunistic spectrum access (OSA) problem where the time-varying condition of each channel (e.g., as a result of random fading or certain primary users' activ...
Cem Tekin, Mingyan Liu
ICML
2003
IEEE
14 years 8 months ago
Principled Methods for Advising Reinforcement Learning Agents
An important issue in reinforcement learning is how to incorporate expert knowledge in a principled manner, especially as we scale up to real-world tasks. In this paper, we presen...
Eric Wiewiora, Garrison W. Cottrell, Charles Elkan
COLING
2010
13 years 2 months ago
Controlling Listening-oriented Dialogue using Partially Observable Markov Decision Processes
This paper investigates how to automatically create a dialogue control component of a listening agent to reduce the current high cost of manually creating such components. We coll...
Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Min...
IADIS
2004
13 years 9 months ago
Addressing the Effective Use of Learning Objects Through Teacher Education
This paper describes the development and evaluation of a curriculum designed to help teachers learn about and integrate digital library functionalities and learning objects into t...
Mimi Recker
ATAL
2008
Springer
13 years 9 months ago
Teaching multi-robot coordination using demonstration of communication and state sharing
Solutions to complex tasks often require the cooperation of multiple robots, however, developing multi-robot policies can present many challenges. In this work, we introduce teach...
Sonia Chernova, Manuela M. Veloso