Integrating Audio and Visual Information for Modelling Communicative Behaviours Perceived as Different