Search Sciweavers | Sciweavers

499 search results - page 64 / 100

» Model Minimization in Markov Decision Processes

click to vote

IJCAI
2007

147views Artificial Intelligence» more IJCAI 2007»

The Value of Observation for Monitoring Dynamic Systems

13 years 10 months ago

Download ijcai.org

We consider the fundamental problem of monitoring (i.e. tracking) the belief state in a dynamic system, when the model is only approximately correct and when the initial belief st...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

click to vote

CVPR
2008
IEEE

192views Computer Vision» more CVPR 2008»

The statistical modelling of fingerprint minutiae distribution with implications for fingerprint individuality studies

14 years 3 months ago

Download fpserver.cse.cuhk.edu.hk

The spatial distribution of fingerprint minutiae is a core problem in the fingerprint individuality study, the cornerstone of the fingerprint authentication technology. Previously...

Jiansheng Chen, Yiu Sang Moon

claim paper

Read More »

click to vote

ICASSP
2010
IEEE

179views Signal Processing» more ICASSP 2010»

Automatic state discovery for unstructured audio scene classification

13 years 8 months ago

Download www.cs.cmu.edu

In this paper we present a novel scheme for unstructured audio scene classiﬁcation that possesses three highly desirable and powerful features: autonomy, scalability, and robust...

Julian Ramos, Sajid M. Siddiqi, Artur Dubrawski, G...

claim paper

Read More »

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

13 years 7 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

14 years 9 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 64 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers