davetaz's Blog

For the last 10 years the digital preservation community has painstakingly attempted to gather together information on digital file formats. This has led to the development of many registries, databases and silos of information. These efforts have been a great success in many communities but the number of people involved is still too few to […]

By davetaz, posted in davetaz's Blog

1st Nov 2012  12:32 PM  11630 Reads  No comments

While open data sources, such as PRONOM, Software Conversion Registry (CSR) and govdocs are excellent examples example of publishing re-usable data (to some extent) there is still a big problem with gaining access to other sources of data. This is mainly due to projects and organisations not focussing on re-usability of data, rather just their […]

By davetaz, posted in davetaz's Blog

29th Aug 2012  1:47 PM  13749 Reads  No comments

As part of the evaluation framework i'm developing for OPF and Scape I've been working on gathering a corpora of files to run experiments against.  Although Govdocs1 would seem like a good place to start there are a few problems: 1) It's too big, 1 Million Files is just showing off. 2) It's full of […]

By davetaz, posted in davetaz's Blog

26th Jul 2012  11:31 AM  23620 Reads  9 Comments

Currently the scientific and R&D communities are continuously talking about data and dataset collection and reuse. Core to these aspects is archiving and preserving this content.  Sharing data is key for many reasons: Discoving new science Re-producing results Verifying research Evidence based decision making Establishing trust   The projects related with OPF are no exception […]

By davetaz, posted in davetaz's Blog

3rd Jul 2012  3:27 PM  10893 Reads  No comments

In order to make the genreation of debian pacakges easy, OPF has created and paying to host a number of Amazon AMIs which can be lauched by anyone. These AMIs are already set up to build the package automatically and their only function is to download the latest release (by tag number), build it and put it […]

By davetaz, posted in davetaz's Blog

8th Mar 2012  2:13 PM  20168 Reads  No comments

  Since joining the project in July 2011 I have focussed on aligning a number of different groups and outputs to be consistant and maintainable into the future. In this way I feel my role is not only to support OPF but to use it as a platform to support the on going digital preservation […]

By davetaz, posted in davetaz's Blog

20th Feb 2012  2:08 PM  21563 Reads  3 Comments