On the storage, management and analysis of (multi) similarity for large scale protein structure datasets in the grid