Although image understanding and natural language processing constitute two major areas of AI, they have mostly been studied independentlyof each other. Only a few attempts have been concerned with the integration of computer vision and the generation of natural language expressions for the description of image sequences. The aim of our joint efforts at combining a vision system and a natural language access system is the automatic simultaneous description of dynamic imagery, i.e., we are interested in image interpretation and language processing on an incremental basis. In this contribution1 we sketch an approach towards the integration of the Karlsruhe vision system called Actions and the natural language component Vitra developed in Saarbr¨ucken. The steps toward realization, based 1 The work described here was partly supported by the Sonderforschungsbereich 314 der Deutschen Forschungsgemeinschaft, “K¨unstliche Intelligenz und wissensbasierte Systeme”, projects V1 (IITB, Kar...
Gerd Herzog, C.-K. Sung, Elisabeth André, W