Blogs

Our blogs are written by contributors from the international digital preservation community. You can find information on a wide range of topics covering tools, project news, case studies and best practice. Everyone is welcome to post a blog and join in the discussion.

Sign in or sign up for an account to get started.

Browse blogs by subject

Earlier this week the National Archives of the Netherlands (NANeth) published a report on preferred file formats. It gives an overview of NANeth’s ‘preferred’ and ‘acceptable’ formats for 9 content categories, and also explains the reasoning behind the selected formats. Even though in Dutch language only, the report is well worth a look. However, I […]

By johan, posted in johan's Blog

9th Dec 2016  3:41 PM  94 Reads  No comments

Flashback is a proof of concept project run by the British Library’s Digital Preservation Team. The project is examining specific emulation and migration solutions as methods for preserving digital content held in the Library’s stock on 3.5” and 5.25” disks and on CD and DVDs.  These could be software items but are mainly supplements to […]

By Simon Whibley, posted in Simon Whibley's Blog

2nd Dec 2016  1:54 PM  254 Reads  No comments

Last week I attended the PREFORMA Experience Workshop in Berlin. The Open Preservation Foundation and PDF Association are leading veraPDF. The morning focused on use cases for conformance checkers from memory institutions and the afternoon explored the PREFORMA challenge with an overview of the testing phase which starts in January 2017. This was followed by […]

By Becky, posted in Becky's Blog

2nd Dec 2016  7:31 AM  124 Reads  No comments

In this Blogpost I want to examine the findings of two validation tools, JHOVE (Version 1.14.6) and Bad Peggy (version 2.0), which scans image files for damages, using the Java Image IO library. Goal is to compare the findings and enable the reader to know what to expect from these validation tools for the daily […]

By Yvonne Tunnat, posted in Yvonne Tunnat's Blog

29th Nov 2016  10:56 AM  386 Reads  No comments

On 11th October we held our first JHOVE online hack day. Our aim was to catalogue error messages produced by JHOVE to get a better understanding of their meaning and potential preservation impact. Background: organising an online hack day We have been considering running online hackathons because attending face-to-face events has become more difficult as […]

By Becky, posted in Becky's Blog

19th Oct 2016  10:06 AM  478 Reads  No comments

BACKGROUND Nearly two and a half years ago, I started an effort for Apache Tika™ to help improve its robustness via TIKA-1302.  Apache Tika™ is an umbrella/wrapper project that “detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).” I documented some of the early work […]

By tallison, posted in tallison's Blog

4th Oct 2016  3:03 PM  949 Reads  No comments

This is a relatively long post, so to summarise before delving into the details: We’re exploring Wikidata, the (relatively new) Wikipedia for data, as a knowledge base for digital preservation information and would appreciate feedback and involvement. At Yale University Library we are beginning a new programme of work (with funding from both CLIR and […]

By Euan Cochrane, posted in Euan Cochrane's Blog

30th Sep 2016  9:47 PM  2698 Reads  7 Comments

Finally, we have confirmation for the original hardware, an HP9845 with an external floppy drive, the HP9895A Disk Drive attached to it. The disks have 77 tracks, or cylinders, and contain 30 sectors with an interleaving factor of 7. The machine was marketed for scientific purposes and was widely used in the university’s medical centre […]

By Denise de Vries, posted in Denise de Vries's Blog

29th Sep 2016  2:17 PM  408 Reads  No comments

Becoming more confident in the low-level encoding, it is time to look at the reproduction of actual bit streams. Our current knowledge, together with the directory listing, strongly suggests an HP system of the 1980s.  The call for input brought up some more information on possible disk formats. A colleague (in another city) who worked […]

By Denise de Vries, posted in Denise de Vries's Blog

28th Sep 2016  5:37 PM  297 Reads  No comments

The experiments and floppy disk imaging in Berlin left us pretty optimistic, that our hardware setup for the recovery task, especially the Mitsubishi floppy drive is working properly. We got a stream image of every disk that we can analyze further. Of course we hoped that we would get results by just running the Kryoflux […]

By Dirk von Suchodoletz, posted in Dirk von Suchodoletz's Blog

26th Sep 2016  9:27 AM  364 Reads  No comments