Search Sciweavers | Sciweavers

1138 search results - page 59 / 228

» Feature Markov Decision Processes

124

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

15 years 3 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

127

click to vote

MCS
2010
Springer

183views Pattern Recognition» more MCS 2010»

Tomographic Considerations in Ensemble Bias/Variance Decomposition

15 years 5 months ago

Download www.ee.surrey.ac.uk

Abstract. Classiﬁer decision fusion has been shown to act in a manner analogous to the back-projection of Radon transformations when individual classiﬁer feature sets are non o...

David Windridge

claim paper

Read More »

112

click to vote

NIPS
2008

132views Information Technology» more NIPS 2008»

Bayesian Model of Behaviour in Economic Games

15 years 4 months ago

Download www.gatsby.ucl.ac.uk

Classical game theoretic approaches that make strong rationality assumptions have difficulty modeling human behaviour in economic games. We investigate the role of finite levels o...

Debajyoti Ray, Brooks King-Casas, P. Read Montague...

claim paper

Read More »

130

click to vote

PERCOM
2007
ACM

189views Computer Networks» more PERCOM 2007»

Sensor Scheduling for Optimal Observability Using Estimation Entropy

16 years 2 months ago

Download people.eng.unimelb.edu.au

We consider sensor scheduling as the optimal observability problem for partially observable Markov decision processes (POMDP). This model fits to the cases where a Markov process ...

Mohammad Rezaeian

claim paper

Read More »

119

click to vote

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

16 years 3 days ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

« Prev « First page 59 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers