In this paper we address the issue of structural multimedia similarity, which is based on the relations between the individual objects that comprise a multimedia document. We propose a binary string encoding for 1D relations which permits the automatic derivation of similarity measures. We then extend it to various resolution levels and many dimensions and show that reasoning on spatiotemporal structure is significantly facilitated in the new framework, by applying it to multimedia presentation and motion similarity. Keywords Multimedia Similarity, Similarity Queries, Spatiotemporal Relations