Blogs: Preservation Risks

Blog posts filtered by the Preservation Risks subject tag.

Browse blogs by subject

Some four years ago I wrote a blog post that demonstrated how Apache Preflight (the PDF/A validator tool that is part of Apache PDFBox) can be used to detect features in a PDF that are potential preservation risks. A follow-up blog applied Schematron rules to the Preflight output in an attempt at doing policy-based assessments. […]

By johan, posted in johan's Blog

1st Jun 2017  1:53 PM  758 Reads  No comments

Join us to help improve JHOVE, an open source, identification, characterisation, and validation tool widely-used by the digital preservation community. Building on the fantastic community collaboration from our first online JHOVE Hack Day, we are happy to announce that registration is open for our second JHOVE Hack Day at: https://jhoveonlinehackday-spring2017.eventbrite.co.uk. During our second online hack day […]

By Becky, posted in Becky's Blog

27th Apr 2017  8:00 AM  0 Reads  No comments

Many factors contribute to the long-term preservation of and access to digital collections. And typically, the endpoint for this material is a repository—or other type of preservation system. But what happens to content after it is stored? How do digital preservationists ensure that content is correct and valid when ingested as well as remains unchanged […]

By caylinsmith, posted in caylinsmith's Blog

23rd Jan 2017  12:57 PM  1845 Reads  1 Comment

In my previous blog post I addressed the detection of broken audio files in an automated workflow for ripping audio CDs. For (data) CD-ROMs and DVDs that are imaged to an ISO image, a similar problem exists: how can we be reasonably sure that the created image is complete? In this blog post I will […]

By johan, posted in johan's Blog

13th Jan 2017  3:30 PM  5871 Reads  5 Comments

While browsing ArchiveTeam's File Formats Wiki earlier this week, I came across some entries I created there on Quattro Pro spreadsheets two years ago. At the time I had also contributed some old Quattro Pro for DOS spreadsheets (here and here) from my personal archives to the OPF format corpus. Seeing those files again, I […]

By johan, posted in johan's Blog

29th Oct 2014  2:59 PM  20365 Reads  2 Comments

We’ve been doing legacy disk extracts at Archives New Zealand for a number of years with much of the effort enabling us to do this work being done by colleague Mick Crouch, and former Archives New Zealand colleague Euan Cochrane – earlier this year, we received some disks from New Zealand’s Department of Conservation (DoC) which we successfully imaged and […]

By ross-spencer, posted in ross-spencer's Blog

23rd Sep 2014  8:14 AM  13090 Reads  4 Comments

Over the last three and a half years, the SCAPE project worked in several directions in order to propose new solutions for digital preservation, as well as improving existing ones. One of the results of this work is the SCAPE preservation environment (SPE). It is a loosely coupled system, which enables extending existing digital repository […]

By jmaferreira, posted in jmaferreira's Blog

19th Sep 2014  1:51 PM  10913 Reads  No comments

I would like to draw your attention to the new QA tool for finger detection on scans: https://github.com/openplanets/finger-detection-tool. This tool was developed by AIT in scope of the SCAPE project.   Checking to identify fingers on scan manually is a very time-consuming and error-prone process. You need a tool to help you: Fingerdet. Fingerdet is […]

By Roman Graf, posted in Roman Graf's Blog

10th Jul 2014  11:49 AM  10157 Reads  No comments

Hi, this is my first blog post in which I want to introduce the project I am currently working on: Flint. history Flint (File/Format Lint) has developed out of DRMLint, a lightweight piece of Java software that makes use of different third party tools (Preflight, iText, Calibre, Jhove) to detect DRM in PDF-files and EPUBs. […]

By alecs, posted in alecs's Blog

2nd Jul 2014  12:53 PM  11282 Reads  No comments

The problem We have a large volume of content on floppy disks that we know are degrading but which we don't know the value of. Considerations We don't want to waste time/resources on low-value content. We don't know the value of the content. We want to be able to back up the content on the […]

By Euan Cochrane, posted in Euan Cochrane's Blog

26th Jun 2014  3:15 PM  12816 Reads  1 Comment