Search Sciweavers | Sciweavers

377 search results - page 44 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

182

click to vote

AUSAI
2005
Springer

166views Artificial Intelligence» more AUSAI 2005»

Adaptive Utility-Based Scheduling in Resource-Constrained Systems

16 years 12 days ago

Download labs.oracle.com

This paper addresses the problem of scheduling jobs in soft real-time systems, where the utility of completing each job decreases over time. We present a utility-based framework fo...

David Vengerov

claim paper

Read More »

183

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

15 years 8 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

221

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 5 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

151

click to vote

ETS
2000
IEEE

90views Hardware» more ETS 2000»

Role of Contracts in Enhancing Community Building in Web Courses

15 years 6 months ago

Download www.ifets.info

Project-based work via telecommunications requires the instructor and the students to take explicit steps to create an on-line community that is focused on high quality learning a...

Karen L. Murphy, Sue E. Mahoney, Tina J. Harvell

claim paper

Read More »

197

click to vote

COLT
2010
Springer

207views Machine Learning» more COLT 2010»

An Asymptotically Optimal Bandit Algorithm for Bounded Support Models

15 years 4 months ago

Download www.colt2010.org

Multiarmed bandit problem is a typical example of a dilemma between exploration and exploitation in reinforcement learning. This problem is expressed as a model of a gambler playi...

Junya Honda, Akimichi Takemura

claim paper

Read More »

« Prev « First page 44 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers