A robust and efficient algorithm for the shape description of protein structures and its application in predicting ligand bindin