Sciweavers

4 search results - page 1 / 1
» Dopamine: generalization and bonuses
Sort
View
NN
2002
Springer
108views Neural Networks» more  NN 2002»
13 years 10 months ago
Dopamine: generalization and bonuses
In the temporal difference model of primate dopamine neurons, their phasic activity reports a prediction error for future reward. This model is supported by a wealth of experiment...
Sham Kakade, Peter Dayan
ATAL
2010
Springer
13 years 11 months ago
PAC-MDP learning with knowledge-based admissible models
PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...
Marek Grzes, Daniel Kudenko
FS
2006
81views more  FS 2006»
13 years 11 months ago
Utility maximization and risk minimization in life and pension insurance
We study the problem of finding optimal strategies for a life insurance company or pension fund that acts on behalf of an insured so as to maximize the expected utility (in a gene...
Peter Holm Nielsen
BIRTHDAY
2009
Springer
14 years 3 months ago
Modular Verification of Strongly Invasive Aspects
An extended specification for aspects, and a new verification method based on model checking are used to establish the correctness of strongly-invasive aspects, independently of a...
Emilia Katz, Shmuel Katz