Managing the Research Data Challenge
Feb 27 : 2:00 PM - 3:00 PM
Through the Jisc funded Filling the Digital Preservation Gap project the Universities of York and Hull have been investigating how research data created within their institutions might be preserved using Archivematica.
This webinar will provide an insight into the project. The Universities initially investigated Archivematica to assess its potential for use in this scenario. Having concluded it is a suitable solution, they then worked with Artefactual Systems to improve Archivematica to enable it to work better within a research data management infrastructure. The final phase of the project saw each institution establish their own proof of concept using Archivematica for the preservation of research data. In this phase of the project the team also further explored the challenges of identifying research data file formats in an automated fashion.
The project is now complete and both institutions are carrying out further work to move their proof of concept work into production. Meanwhile, the OPF’s Technical Lead, Carl Wilson, has been working with the Jisc team on the problem of identifying file formats found in research data. By analysing sample datasets from research institutions they’re working to establish:
- the variety of formats encountered in research datasets;
- whether the current format identification tools, e.g. DROID, Tika, file, etc. give sufficient identification coverage for research data; and
- if alternative tools and approaches to the problem might bridge the format identification gap.
- Jenny Mitcham, University of York
- Carl Wilson, Open Preservation Foundation
The webinar takes place at 14:00 BST / 15:00 CET and will last approximately one hour. There are 50 place available that will be allocated on a first come, first served basis. The recording and slides will be available to OPF members for those who cannot attend at this time.