Sciweavers

84 search results - page 11 / 17
» Finding and transferring policies using stored behaviors
Sort
View
ICML
2006
IEEE
14 years 8 months ago
Qualitative reinforcement learning
When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...
Arkady Epshteyn, Gerald DeJong
IEPOL
2002
64views more  IEPOL 2002»
13 years 7 months ago
Is there 'Hope' for price cap regulation?
This article examines the problem of regulatory opportunism or strategic behavior by the regulator that undermines the performance of price cap regulation (PCR) without technicall...
Dennis L. Weisman
ANOR
2008
66views more  ANOR 2008»
13 years 7 months ago
Optimization problems involving collections of dependent objects
We describe a class of problems motivated by numerous real-world applications where there is a collection of objects that have both a cost and a value, but where some of those obj...
David L. Roberts, Charles L. Isbell, Michael L. Li...
EUROCAST
2007
Springer
182views Hardware» more  EUROCAST 2007»
14 years 1 months ago
A k-NN Based Perception Scheme for Reinforcement Learning
Abstract a paradigm of modern Machine Learning (ML) which uses rewards and punishments to guide the learning process. One of the central ideas of RL is learning by “direct-online...
José Antonio Martin H., Javier de Lope Asia...
CCS
2010
ACM
13 years 7 months ago
An empirical study of privacy-violating information flows in JavaScript web applications
The dynamic nature of JavaScript web applications has given rise to the possibility of privacy violating information flows. We present an empirical study of the prevalence of such...
Dongseok Jang, Ranjit Jhala, Sorin Lerner, Hovav S...