Andy Jackson's Blog

Andy has a background in computational physics and information systems architecture, including many years of experience designing algorithms for very large-scale data management and scientific computation. As a member of the Planets project, he helped define and develop a service-oriented architecture appropriate for digital preservation, and co-authored the Testbed software that builds upon it. He is now a Digital Preservation Architect and the British Library, working with the rest of the Digital Preservation Team and in collaboration with Open Planets Foundation to help define the library's approach to digital preservation. He is also the chair of the technical coordination committee for the upcoming SCAPE project.

We recently posted an article on the UK Web Archive blog that may be of interest here, User-Driven Digital Preservation, where we summarise our work with the SCAPE Project on a little prototype application that explores how we might integrate user feedback and preservation actions into our usual discovery and access processes. The idea is […]

By Andy Jackson, posted in Andy Jackson's Blog

28th Aug 2014  8:53 PM  13670 Reads  No comments

I thought OPF members might be interested in this UK Web Archive blog post I wrote on format identification and validation of our historical web archives: How much of the UK's HTML is valid?

By Andy Jackson, posted in Andy Jackson's Blog

2nd Jul 2014  12:05 PM  9969 Reads  No comments

I've started to publish some of my notes on digital preservation. It's mostly a collection of 'war stories' and summaries of some of the little experiments I've carried out over the years, but never had time to write up properly. The idea of publishing these stories is inspired in part by XFNSTN, but also by my experience […]

By Andy Jackson, posted in Andy Jackson's Blog

23rd Sep 2013  10:30 PM  13467 Reads  2 Comments

If you've not already read it, I urge you to go and read Ben Fino-Radin's piece on XFN STN, called: "IT TAKES A VILLAGE TO SAVE A HARD DRIVE". I found it both truthful and inspiring… Truthful, because the chaotic path of discovery involved in understanding mysterious digital media reflected my own experiences on similar digital […]

By Andy Jackson, posted in Andy Jackson's Blog

13th Sep 2013  9:49 PM  12011 Reads  No comments

Today, the UK Web Archive is releasing a new suite of visualisations and datasets. Amongst these is a dataset that I imagine will be of interest to this audience, as it concerns format identification and tools. I will be presenting the full results at iPres 2012, but I wanted to give you a sneak preview, […]

By Andy Jackson, posted in Andy Jackson's Blog

17th Aug 2012  9:18 AM  17093 Reads  No comments

As Paul has already noted, there are a a number of new efforts to crowdsource format information. Superficially, this might look like a duplication of effort, but I don’t think that needs to be the case. In fact I think they could fit together rather neatly, and combine to produce exactly the kind of diversity of registries […]

By Andy Jackson, posted in Andy Jackson's Blog

6th Jul 2012  9:53 PM  12770 Reads  No comments

In my previous post on formats, I ended up leaning towards a wait-and-see approach to format registry design. Unfortunately, I don’t really have that luxury. The SCAPE project needs to collect more format information to assist preservation planning and other processes. We even have some effort available to help build and/or fill a registry. But […]

By Andy Jackson, posted in Andy Jackson's Blog

11th Aug 2011  8:27 PM  15430 Reads  No comments

Bill’s earlier post and this one from Chris Rusbridge have spurred me to try to describe what I discovered about PRONOM format records during my editable registry experiment. Building that site required a close inspection of the PRONOM Format Record data model, during which I realised that we commonly conflate two quite different ways of […]

By Andy Jackson, posted in Andy Jackson's Blog

10th Aug 2011  2:19 PM  21960 Reads  No comments

This is what the first year of OPF development on the Planets Suite codebase looks like… During the first few moments, as the Planets project ends and OPF winds up, you can see the ‘explosions’ as the Testbed and then Plato are added to the GitHub codebase. Things quieten down over the winter months, with […]

By Andy Jackson, posted in Andy Jackson's Blog

24th Jun 2011  8:51 AM  12278 Reads  No comments

I’ve been keeping an eye on the #anadp11 Twitter backchannel, particularly the discussion about standards. I’m not there, so I don’t know what points have been made, but I want to try to head off a common misconception. Standards are wonderful things, but the standard itself is not enough. It’s the social consensus built by […]

By Andy Jackson, posted in Andy Jackson's Blog

24th May 2011  8:37 PM  13310 Reads  No comments