Abstract |
Spreadsheets models are frequently used by scientists to analyze research data. As the calculation workflow in these models is not made explicit, peers are not able to fully understand or assess the calculation of research results. We proposes a methodology for semi automatically deriving the calculation workflow underlying a set of spreadsheets. The starting point of our methodology is the cell dependency graph, representing all spreadsheet cells and connections. We aggregate this graph by removing removing multiple instances of the same quantities, and removing redundant calculations. Results from three case studies show that our constructed calculation models approximate the ground truth calculation work flows, both in terms of content and size, but are not a perfect match |