johan's Blog

Digital Preservation Researcher at KB / National Library of the Netherlands

Over the last few years, the EPUB format has gained widespread popularity in the consumer market. The KB has been approached by a number of publishers that wish to use EPUB for delivering some of their electronic publications. Surprisingly little information is available on the format's suitability for archival preservation, apart from Library of Congress' […]

By johan, posted in johan's Blog

18th Jun 2012  12:48 PM  20396 Reads  4 Comments

In this blog post I will give a brief update of the latest jpylyzer developments. Jpylyzer is a validation and feature extraction tool for the JP2 (JPEG 2000 Part 1) still image format. History of jpylyzer Around mid-summer 2011, the KB started initial preparations for migrating 146 TB of TIFF images from the Dutch Metamorfoze […]

By johan, posted in johan's Blog

23rd Apr 2012  10:53 AM  19762 Reads  No comments

This will be my shortest blog post ever. Following up on my previous blog post on a prototype JP2 validator and properties extractor (jpylyzer), there is now a comprehensive User Manual of the tool. Just follow the link below: https://github.com/openplanets/jpylyzer/blob/master/doc/jpylyzerUserManual.pdf?raw=true Link to jpylyzer home page: http://openplanets.github.io/jpylyzer/ Meanwhile work on jpylyzer remains ongoing, so watch this […]

By johan, posted in johan's Blog

10th Jan 2012  5:07 PM  17292 Reads  No comments

A few months ago I wrote a blog post on a simple JP2 file structure checker. This led to some interesting online discussions on JP2 validation. Some people asked me about the feasibility of expanding the tool to a full-fledged JP2 validator. Despite some initial reservations, I eventually decided to dedicate a couple of weeks […]

By johan, posted in johan's Blog

14th Dec 2011  12:30 PM  23914 Reads  No comments

As I already briefly mentioned in a previous blog post, one of the objectives of the SCAPE project is to develop an architecture that will enable large scale characterisation of digital file objects. As a first step, we are evaluating existing characterisation tools. The overall aim of this work is twofold. First, we want to […]

By johan, posted in johan's Blog

21st Sep 2011  1:40 PM  19565 Reads  No comments

Over the last few weeks I’ve been working on the design of a workflow that the KB is planning to use for the migration of a collection of (mostly old) TIFF images to JP2. One major risk of such a migration is that hardware failures during the migration process may result in corrupted images. For […]

By johan, posted in johan's Blog

1st Sep 2011  8:47 AM  36305 Reads  No comments

As a part of the SCAPE project, I’m currently heavily involved in the evaluation of various file format identification tools. The overall aim of this work is to determine which tools are suitable candidates for inclusion in the SCAPE architecture. In addition, we’re also trying to get a better idea of each tool’s specific strengths […]

By johan, posted in johan's Blog

11th Jul 2011  4:49 PM  21593 Reads  No comments

The JPEG 2000 compression standard is steadily becoming more and more popular in the archival community. Several large (national) libraries are now using the JP2 format (which corresponds to Part 1 of the standard) as the master format in mass digitisation projects. However, some aspects of the JP2 file format are defined in ways that […]

By johan, posted in johan's Blog

6th Jun 2011  10:11 AM  21334 Reads  3 Comments