Using OGSA-DAI to access Taverna Provenance in MyGrid

Student: TBD
Institution: TBD
Supervisor(s): TBD
Date: TBD

Abstract:

The myGrid project has developed a work flow enactment engine called Taverna. This allows biologists to create, coordinate and monitor workflows across biological systems (although Taverna's scope is not restricted solely to biological systems). For biologists, the maintenance of provenance (that is what is done to data or how it is derived, who did it, when they did it, what the process/workflow was, etc) is very important if anyone is to have any confidence in the production, or use, of any results. Currently Taverna stores such provenance in a database using a direct JDBC connection. It might prove advantageous to allow the provenance to be stored using OGSA-DAI. This would provide an abstraction layer over the storage resource layer so there is no explicit coupling to any particular database systems. In addition a certain degree of fault tolerance could be built in so that if a primary database is not available then a secondary one could be used to store the provenance information.

This project aims to test the above assertion. First it will establish mechanisms that allow Taverna to store the provenance data using OGSA-DAI then it will establish what the advantages, and any disadvantages, in employing OGSA-DAI to store provenance data for Taverna.

If would be interested in carrying this project out then please .