Andy Jackson's Blog
Andy has a background in computational physics and information systems architecture, including many years of experience designing algorithms for very large-scale data management and scientific computation. As a member of the Planets project, he helped define and develop a service-oriented architecture appropriate for digital preservation, and co-authored the Testbed software that builds upon it. He is now a Digital Preservation Architect and the British Library, working with the rest of the Digital Preservation Team and in collaboration with Open Planets Foundation to help define the library's approach to digital preservation. He is also the chair of the technical coordination committee for the upcoming SCAPE project.
We recently posted an article on the UK Web Archive blog that may be of interest here, User-Driven Digital Preservation, where we summarise our work with the SCAPE Project on a little prototype application that explores how we might integrate user feedback and preservation actions into our usual discovery and access processes. The idea is […]
I thought OPF members might be interested in this UK Web Archive blog post I wrote on format identification and validation of our historical web archives: How much of the UK's HTML is valid?
I've started to publish some of my notes on digital preservation. It's mostly a collection of 'war stories' and summaries of some of the little experiments I've carried out over the years, but never had time to write up properly. The idea of publishing these stories is inspired in part by XFNSTN, but also by my experience […]
If you've not already read it, I urge you to go and read Ben Fino-Radin's piece on XFN STN, called: "IT TAKES A VILLAGE TO SAVE A HARD DRIVE". I found it both truthful and inspiring… Truthful, because the chaotic path of discovery involved in understanding mysterious digital media reflected my own experiences on similar digital […]
Today, the UK Web Archive is releasing a new suite of visualisations and datasets. Amongst these is a dataset that I imagine will be of interest to this audience, as it concerns format identification and tools. I will be presenting the full results at iPres 2012, but I wanted to give you a sneak preview, […]
As Paul has already noted, there are a a number of new efforts to crowdsource format information. Superficially, this might look like a duplication of effort, but I don’t think that needs to be the case. In fact I think they could fit together rather neatly, and combine to produce exactly the kind of diversity of registries […]
In my previous post on formats, I ended up leaning towards a wait-and-see approach to format registry design. Unfortunately, I don’t really have that luxury. The SCAPE project needs to collect more format information to assist preservation planning and other processes. We even have some effort available to help build and/or fill a registry. But […]
Bill’s earlier post and this one from Chris Rusbridge have spurred me to try to describe what I discovered about PRONOM format records during my editable registry experiment. Building that site required a close inspection of the PRONOM Format Record data model, during which I realised that we commonly conflate two quite different ways of […]
This is what the first year of OPF development on the Planets Suite codebase looks like… During the first few moments, as the Planets project ends and OPF winds up, you can see the ‘explosions’ as the Testbed and then Plato are added to the GitHub codebase. Things quieten down over the winter months, with […]
I’ve been keeping an eye on the #anadp11 Twitter backchannel, particularly the discussion about standards. I’m not there, so I don’t know what points have been made, but I want to try to head off a common misconception. Standards are wonderful things, but the standard itself is not enough. It’s the social consensus built by […]