Description

Title Situating the problem of reconstructing provenance
Abstract As a new PhD student, I will start the presentation by introducing myself and my background. The rest of the presentation will be about my current research topic: reconstructing provenance. The provenance of a data item is the metadata describing how, when and by whom the data item was produced. Provenance is crucial in order to assess authenticity, enable trust and allow reproducibility. Most current provenance management systems assume that provenance is fully tracked, for example by instrumenting appropriately the software in use. In practice, this rarely happens, leading to incomplete or missing provenance trails. The aim of my current and future work is to research approaches to reconstructing provenance. As a starting point, I am currently considering the reconstruction of the dependency chains between documents in a shared folder. For example, how can we detect which document was created using information contained in another document? In this presentation, I will introduce the problem and give an overview of related work.