With the rise of XML, the database community has been challenged by semi-structured data processing. Since the data type behind XML is the tree, state-of-the-art RDBMSs have learn...
An e-lesson is comprised of a "body" and a "view". The body is the actual content of the e-lesson and the assumption is that it is an html document. The view i...
This work addresses the soundtrack indexing of multimedia documents. We present and merge two audio classification tools that we have developed. The first one, a speech music clas...
This paper presents a novel algorithm for document clustering based on a combinatorial framework of the Principal Direction Divisive Partitioning (PDDP) algorithm [1] and a simpli...
We consider fast two-sided error-tolerant search that is robust against errors both on the query side (type alogrithm, find documents with algorithm) as well as on the document si...