Sciweavers

3084 search results - page 150 / 617
» Learning to Take Actions
Sort
View
COLT
2008
Springer
15 years 5 months ago
Regret Bounds for Sleeping Experts and Bandits
We study on-line decision problems where the set of actions that are available to the decision algorithm vary over time. With a few notable exceptions, such problems remained larg...
Robert D. Kleinberg, Alexandru Niculescu-Mizil, Yo...
WSPI
2008
15 years 5 months ago
Practices, Systems, and Context Working as Core Concepts in Modeling Socio-Technical Systems
This work draws on the cultural historical activity-theory and the theory of social systems to model socio-technical systems. The concepts of practice, system, and context work as ...
Heidrun Allert, Christoph Richter
CORR
2010
Springer
106views Education» more  CORR 2010»
15 years 4 months ago
MDPs with Unawareness
Markov decision processes (MDPs) are widely used for modeling decision-making problems in robotics, automated control, and economics. Traditional MDPs assume that the decision mak...
Joseph Y. Halpern, Nan Rong, Ashutosh Saxena
146
Voted
AAAI
2011
14 years 4 months ago
Differential Eligibility Vectors for Advantage Updating and Gradient Methods
In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...
Francisco S. Melo

Publication
352views
15 years 12 months ago
Efficient methods for near-optimal sequential decision making under uncertainty
This chapter discusses decision making under uncertainty. More specifically, it offers an overview of efficient Bayesian and distribution-free algorithms for making near-optimal se...
Christos Dimitrakakis