Blogs: Identification

Blog posts filtered by the Identification subject tag.

Browse blogs by subject

Inspired by Jenny Micham’s blog post about developing her first file format signature, I thought it would be fun to take a crack at creating one myself. I previously dipped my toe into the world of contributing to PRONOM by looking at a few mis-identifications and multi-identifications, but I had yet to create a file […]

By Andrea Byrne, posted in Andrea Byrne's Blog

8th Sep 2016  9:14 AM  1153 Reads  4 Comments

Jenny Mitcham, Digital Archivist at the University of York started a nice snowball rolling last week when she asked “Research data – what does it really look like?” Paul Young at the National Archives, UK, was one of those to respond, to show that perhaps the snowball had been generating momentum for a number of […]

By ross-spencer, posted in ross-spencer's Blog

14th Jun 2016  6:43 AM  1356 Reads  No comments

As promised yesterday this is the follow up blog to the refactor of my original DROID SQLite Analysis work. The new version now allows you to produce reports from the format identification tool Siegfried. In this blog I wanted to talk about a small number of other details that can be a bit harder to […]

By ross-spencer, posted in ross-spencer's Blog

24th May 2016  9:59 AM  1081 Reads  No comments

With the release of the latest Siegfried there was added motivation for me to provide an analysis output for the format identification tool. With ‘double the magic’ there was a lot more for us to explore as analysts, and fingers crossed this release (a refactor) of my SQLite based analysis tool will help with that exploration. Previous […]

By ross-spencer, posted in ross-spencer's Blog

23rd May 2016  6:56 AM  1010 Reads  No comments

FIDO (Format Identification for Digital Objects) is a command-line tool maintained by the OPF and used for identification of digital files based on their PRONOM signature. This webinar will provide an update on the FIDO project by outlining recent improvements to the tool. We will also discuss: Why is format identification so important? What are […]

By Becky, posted in Becky's Blog

10th May 2016  4:00 PM  0 Reads  No comments

The latest version of siegfried, 1.5.0, has just been released. The big change is support for a second identifier type, freedesktop.org’s Shared MIME-Info specification.

By Richard, posted in Richard's Blog

13th Mar 2016  5:42 AM  1497 Reads  No comments

For anyone dealing with a relatively small number of records, compared to say an internet or data archive, a reasonable process for ingest of material into your digital preservation system might be: 1. Process files with a file format identification tool 2. Per 1. process files with a file format validation tool 3. Per 1. […]

By ross-spencer, posted in ross-spencer's Blog

13th Mar 2016  5:27 AM  1641 Reads  No comments

At my workplace, we write a lot of small scripts to encode preservation workflows. These scripts pipeline simple actions like munging metadata, moving files about, and calling other tools such as Tika and ImageMagick. Often these actions are conditional on the format of the file being processed: for example, we only want to run Tika over the formats for […]

By Richard, posted in Richard's Blog

18th Feb 2016  3:20 AM  1472 Reads  No comments

This is the second blog inspired by my visit to colleagues at National Library of Australia, last August. The first, discusses a federated approach to better incorporating custom signatures into the PRONOM signature base without modifying PRONOM. The essence of the blog, however, still centers around how the community can create signatures for itself, and […]

By ross-spencer, posted in ross-spencer's Blog

7th Jan 2016  7:15 AM  1783 Reads  No comments

Presented here is a tool that will create a 'rogues gallery' out of any digital collection for which you have a DROID report for (alternatively, soon, a Siegfried report for). The tool was presented at a recent OPF Webinar, Preservation in Practice: Archives New Zealand; slides here. And was created by myself and Andrea K. […]

By ross-spencer, posted in ross-spencer's Blog

25th Aug 2015  9:44 AM  2352 Reads  No comments