Blogs

Our blogs are written by contributors from the international digital preservation community. You can find information on a wide range of topics covering tools, project news, case studies and best practice. Everyone is welcome to post a blog and join in the discussion.

Sign in or sign up for an account to get started.

Browse blogs by subject

In my previous blog post I addressed the detection of broken audio files in an automated workflow for ripping audio CDs. For (data) CD-ROMs and DVDs that are imaged to an ISO image, a similar problem exists: how can we be reasonably sure that the created image is complete? In this blog post I will […]

By johan, posted in johan's Blog

13th Jan 2017  3:30 PM  4409 Reads  5 Comments

At the KB we have a large collection of offline optical media. Most of these are CD-ROMs, but we also have a sizeable proportion of audio CDs. We’re currently in the process of designing a workflow for stabilising the contents of these materials using disk imaging. For audio CDs this involves ‘ripping’ the tracks to […]

By johan, posted in johan's Blog

4th Jan 2017  2:38 PM  1134 Reads  3 Comments

As you might know, the PREFORMA project works on one of the main challenges memory institutions are facing nowadays: the long-term preservation of digital data. If your memory institution is working with or is helping to refine open source tools, please share your experiences via this survey. It should take about 20 minutes to complete, and […]

By MelanieImming, posted in MelanieImming's Blog

13th Dec 2016  12:38 PM  754 Reads  No comments

Earlier this week the National Archives of the Netherlands (NANeth) published a report on preferred file formats. It gives an overview of NANeth’s ‘preferred’ and ‘acceptable’ formats for 9 content categories, and also explains the reasoning behind the selected formats. Even though in Dutch language only, the report is well worth a look. However, I […]

By johan, posted in johan's Blog

9th Dec 2016  3:41 PM  1827 Reads  1 Comment

Flashback is a proof of concept project run by the British Library’s Digital Preservation Team. The project is examining specific emulation and migration solutions as methods for preserving digital content held in the Library’s stock on 3.5” and 5.25” disks and on CD and DVDs.  These could be software items but are mainly supplements to […]

By Simon Whibley, posted in Simon Whibley's Blog

2nd Dec 2016  1:54 PM  950 Reads  No comments

Last week I attended the PREFORMA Experience Workshop in Berlin. The Open Preservation Foundation and PDF Association are leading veraPDF. The morning focused on use cases for conformance checkers from memory institutions and the afternoon explored the PREFORMA challenge with an overview of the testing phase which starts in January 2017. This was followed by […]

By Becky, posted in Becky's Blog

2nd Dec 2016  7:31 AM  775 Reads  No comments

In this Blogpost I want to examine the findings of two validation tools, JHOVE (Version 1.14.6) and Bad Peggy (version 2.0), which scans image files for damages, using the Java Image IO library. Goal is to compare the findings and enable the reader to know what to expect from these validation tools for the daily […]

By Yvonne Tunnat, posted in Yvonne Tunnat's Blog

29th Nov 2016  10:56 AM  1770 Reads  1 Comment

On 11th October we held our first JHOVE online hack day. Our aim was to catalogue error messages produced by JHOVE to get a better understanding of their meaning and potential preservation impact. Background: organising an online hack day We have been considering running online hackathons because attending face-to-face events has become more difficult as […]

By Becky, posted in Becky's Blog

19th Oct 2016  10:06 AM  1082 Reads  No comments

BACKGROUND Nearly two and a half years ago, I started an effort for Apache Tika™ to help improve its robustness via TIKA-1302.  Apache Tika™ is an umbrella/wrapper project that “detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).” I documented some of the early work […]

By tallison, posted in tallison's Blog

4th Oct 2016  3:03 PM  1592 Reads  No comments

This is a relatively long post, so to summarise before delving into the details: We’re exploring Wikidata, the (relatively new) Wikipedia for data, as a knowledge base for digital preservation information and would appreciate feedback and involvement. At Yale University Library we are beginning a new programme of work (with funding from both CLIR and […]

By Euan Cochrane, posted in Euan Cochrane's Blog

30th Sep 2016  9:47 PM  4500 Reads  7 Comments