Dynamically structuring, updating and interrelating representations of visual and linguistic discourse context