What's There to Talk About? A Multi-Modal Model of Referring Behavior in the Presence of Shared Visual Information