Sciweavers

377 search results - page 44 / 76
» Optimizing Production Manufacturing Using Reinforcement Lear...
Sort
View
AUSAI
2005
Springer
14 years 2 months ago
Adaptive Utility-Based Scheduling in Resource-Constrained Systems
This paper addresses the problem of scheduling jobs in soft real-time systems, where the utility of completing each job decreases over time. We present a utility-based framework fo...
David Vengerov
NIPS
2008
13 years 10 months ago
Structure Learning in Human Sequential Decision-Making
We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...
Daniel Acuña, Paul R. Schrater
CORR
2010
Springer
204views Education» more  CORR 2010»
13 years 7 months ago
Predictive State Temporal Difference Learning
We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identification. In practical applications...
Byron Boots, Geoffrey J. Gordon
ETS
2000
IEEE
90views Hardware» more  ETS 2000»
13 years 8 months ago
Role of Contracts in Enhancing Community Building in Web Courses
Project-based work via telecommunications requires the instructor and the students to take explicit steps to create an on-line community that is focused on high quality learning a...
Karen L. Murphy, Sue E. Mahoney, Tina J. Harvell
COLT
2010
Springer
13 years 6 months ago
An Asymptotically Optimal Bandit Algorithm for Bounded Support Models
Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...
Junya Honda, Akimichi Takemura