Building a better probabilistic model of images by factorization

13 years 4 months ago

Download redwood.berkeley.edu

We describe a directed bilinear model that learns higherorder groupings among features of natural images. The model represents images in terms of two sets of latent variables: one set of variables represents which feature groups are active, while the other speciﬁes the relative activity within groups. Such a factorized representation is beneﬁcial because it is stable in response to small variations in the placement of features while still preserving information about relative spatial relationships. When trained on MNIST digits, the resulting representation provides state of the art performance in classiﬁcation using a simple classiﬁer. When trained on natural images, the model learns to group features according to proximity in position, orientation, and scale. The model achieves high log-likelihood (-94 nats), surpassing the current state of the art for natural images achievable with an mcRBM model. 1

Jack Culpepper, Jascha Sohl-Dickstein, Bruno Olaha

Real-time Traffic