Blogs: Tools

Blog posts filtered by the Tools subject tag.

Browse blogs by subject

All subjects Access Analysis Android apache tika ApacheTika AQuA ARC ARC to WARC archives archiving audiovisual Benchmark benchmarking best practice best practices Bit rot bitcurator board game British Library Characterisation Community compression Corpora CSV-Validator curation Database Database Archiving Database Preservation Delivery Digital Forensics digital preservation digitisation Disk Images DROID E-ARK Project EaaS Education Emulation epub Experimentation extensible Fido File Formats FLAC Flashback floppy disk floppy disks floppy drive Format Identification Format Registry GitHub Hackathon Hardware obsolescence help httpreserve Identification IDPD17 IMPACT Internet Standards iPRES. community survey isolyzer jhove job JP2 JPEG2000 jpylyzer LZW magnetic media Matchbox MediaConch Members Metadata metadate Migration Monitoring Normalisation OCR open Open Planets Foundation Open Preservation Foundation Open source OPF diary Optimization Packaging PDF PDF/A Planets policy PREFORMA PREMIS preservation Preservation Actions preservation planning Preservation Risks Preservation Strategies Preservia Process Projects PRONOM Provenance pywb recordkeeping records Representation Information Research data research infrastructure Resources RFC Rogues Gallery Rosetta Roy SCAPE Server Siegfried Signature Development Software Software benchmarking SPARQL specification spreadsheets SPRUCE standards technical technical registry testing TIFF Tika Tools training validation veraPDF Virtual Machines w3c WARC Watch WAV WAVE Web Archiving Web Publications wget Wikidata Workflow Workflows Zip

In my previous blog post I addressed the detection of broken audio files in an automated workflow for ripping audio CDs. For (data) CD-ROMs and DVDs that are imaged to an ISO image, a similar problem exists: how can we be reasonably sure that the created image is complete? In this blog post I will […]

By johan, posted in johan's Blog

13th Jan 2017  3:30 PM  10005 Reads  5 Comments

The webinar is dedicated to the practical preservation of relational database. The first part of the webinar introduces the updates which have been done to the original SIARD format in collaboration by the E-ARK project and the Swiss Federal Archives. Most notably, the SIARD 2.0 format adds additional scalability and support for newer SQL methods […]

By Becky McGuinness, posted in Becky McGuinness's Blog

9th Nov 2016  1:00 PM  0 Reads  No comments

The PREFORMA project is running a webinar series throughout September. The final webinar focuses on veraPDF. Overview The PREFORMA project’s prototyping phase finishes at the end of 2016. The veraPDF consortium will be producing a v1.0 release candidate of the software library and applications in December 2016. This webinar demonstrates the current state of veraPDF […]

By Becky McGuinness, posted in Becky McGuinness's Blog

22nd Sep 2016  2:00 PM  0 Reads  No comments

The PREFORMA project is running a webinar series throughout September. The first webinar focuses on DPF Manager. Programme: DPF Manager introduction DPF Manager Demo How to contribute Future of DPF Manager Q&A Outline: DPF Manager is a multi-platform application and a framework designed to empower end users and developers to gain full control over the […]

By Becky McGuinness, posted in Becky McGuinness's Blog

8th Sep 2016  2:00 PM  0 Reads  No comments

Jenny Mitcham, Digital Archivist at the University of York started a nice snowball rolling last week when she asked “Research data – what does it really look like?” Paul Young at the National Archives, UK, was one of those to respond, to show that perhaps the snowball had been generating momentum for a number of […]

By ross-spencer, posted in ross-spencer's Blog

14th Jun 2016  6:43 AM  3241 Reads  No comments

As promised yesterday this is the follow up blog to the refactor of my original DROID SQLite Analysis work. The new version now allows you to produce reports from the format identification tool Siegfried. In this blog I wanted to talk about a small number of other details that can be a bit harder to […]

By ross-spencer, posted in ross-spencer's Blog

24th May 2016  9:59 AM  2758 Reads  No comments

With the release of the latest Siegfried there was added motivation for me to provide an analysis output for the format identification tool. With ‘double the magic’ there was a lot more for us to explore as analysts, and fingers crossed this release (a refactor) of my SQLite based analysis tool will help with that exploration. Previous […]

By ross-spencer, posted in ross-spencer's Blog

23rd May 2016  6:56 AM  2594 Reads  No comments

The latest version of siegfried, 1.5.0, has just been released. The big change is support for a second identifier type, freedesktop.org’s Shared MIME-Info specification.

By Richard, posted in Richard's Blog

13th Mar 2016  5:42 AM  3124 Reads  No comments

For anyone dealing with a relatively small number of records, compared to say an internet or data archive, a reasonable process for ingest of material into your digital preservation system might be: 1. Process files with a file format identification tool 2. Per 1. process files with a file format validation tool 3. Per 1. […]

By ross-spencer, posted in ross-spencer's Blog

13th Mar 2016  5:27 AM  3950 Reads  No comments

This is the second blog inspired by my visit to colleagues at National Library of Australia, last August. The first, discusses a federated approach to better incorporating custom signatures into the PRONOM signature base without modifying PRONOM. The essence of the blog, however, still centers around how the community can create signatures for itself, and […]

By ross-spencer, posted in ross-spencer's Blog

7th Jan 2016  7:15 AM  4453 Reads  No comments