The ease with which one can copy and transform data on the Web, has made it increasingly di cult to determine the origins of a piece of data. We use the term data provenance to refer to the process of tracing and recording the origins of data and its movement between databases. Provenance is now an acute issue in scienti c databases where it central to the validation of data. In this paper we discuss some of the technical issues that have emerged in an initial exploration of the topic.