Blogs: Migration

Blog posts filtered by the Migration subject tag.

Browse blogs by subject

All subjects Access Analysis Android apache tika ApacheTika AQuA ARC ARC to WARC archives archiving audiovisual Benchmark benchmarking best practice best practices Bit rot bitcurator board game British Library Characterisation Community compression Corpora curation Database Database Archiving Database Preservation Delivery Digital Forensics digital preservation digitisation Disk Images DROID E-ARK Project EaaS Education Emulation epub Experimentation extensible Fido File Formats FLAC Flashback floppy disk floppy disks floppy drive Format Identification Format Registry GitHub Hackathon Hardware obsolescence help httpreserve Identification IDPD17 IMPACT Internet Standards isolyzer jhove job JP2 JPEG2000 jpylyzer LZW magnetic media Matchbox MediaConch Members Metadata metadate Migration Monitoring Normalisation OCR open Open Planets Foundation Open Preservation Foundation Open source OPF diary Optimization Packaging PDF PDF/A Planets policy PREFORMA PREMIS preservation Preservation Actions preservation planning Preservation Risks Preservation Strategies Preservia Process Projects PRONOM Provenance pywb recordkeeping records Representation Information Research data research infrastructure Resources RFC Rogues Gallery Rosetta Roy SCAPE Siegfried Signature Development Software Software benchmarking SPARQL specification spreadsheets SPRUCE standards technical technical registry testing TIFF Tika Tools training validation veraPDF w3c WARC Watch WAV WAVE Web Archiving Web Publications wget Wikidata Workflow Workflows Zip

Two weeks ago our archive interest group (AIG) held their first face to face meeting. We were kindly hosted by the Danish National Archives in Copenhagen. Currently the AIG comprises participants from our three national archive members in Denmark (Rigsarkivet) the Netherlands (Nationaal Archief), and Estonia (Rahvusarhiiv) (new members are welcome). The group has been […]

By Becky, posted in Becky's Blog

24th Feb 2017  9:45 AM  1973 Reads  No comments

I read Johan's fascinating post this morning about the obsolescence of Quattro Pro formats. In the post he included a set of Quattro Pro spreadsheets and invited others to try to access them using the original software or an old version of Excel. Luckily I have an old version of Quattro Pro available so I thought I'd […]

By Euan Cochrane, posted in Euan Cochrane's Blog

29th Oct 2014  6:24 PM  16192 Reads  1 Comment

While browsing ArchiveTeam's File Formats Wiki earlier this week, I came across some entries I created there on Quattro Pro spreadsheets two years ago. At the time I had also contributed some old Quattro Pro for DOS spreadsheets (here and here) from my personal archives to the OPF format corpus. Seeing those files again, I […]

By johan, posted in johan's Blog

29th Oct 2014  2:59 PM  23522 Reads  2 Comments

Some time ago Will Palmer, Peter May and Peter Cliff of the British Library published a really interesting paper that investigated three different JPEG 2000 codecs, and their effects on image quality in response to lossy compression. Most remarkably, their analysis revealed differences not only in the way these codecs encode (compress) an image, but […]

By johan, posted in johan's Blog

26th Sep 2014  1:06 PM  16915 Reads  3 Comments

We’ve been doing legacy disk extracts at Archives New Zealand for a number of years with much of the effort enabling us to do this work being done by colleague Mick Crouch, and former Archives New Zealand colleague Euan Cochrane – earlier this year, we received some disks from New Zealand’s Department of Conservation (DoC) which we successfully imaged and […]

By ross-spencer, posted in ross-spencer's Blog

23rd Sep 2014  8:14 AM  14033 Reads  4 Comments

It is well-known that PDF documents can contain features that are preservation risks (e.g. see here and here). Migration of existing PDFs to PDF/A is sometimes advocated as a strategy for mitigating these risks. However, the benefits of this approach are often questionable, and the migration process can also be quite risky in itself. As […]

By johan, posted in johan's Blog

27th Aug 2014  3:47 PM  18558 Reads  9 Comments

This blog post continues a series of posts about the weeb archiving topic „ARC to WARC migration“, namely it is a follow-up on the posts „ARC to WARC migration: How to deal with de-duplicated records?“, and „Some reflections on scalable ARC to WARC migration“. Especially the last one of these posts ,which described how SCAPE […]

By shsdev, posted in shsdev's Blog

10th Jul 2014  10:44 AM  12131 Reads  No comments

During my time at The National Archives UK, colleague, Adam Retter, developed a methodology for the reversible pre-conditioning of complex binary objects. The technique was required to avoid the doubling of storage for malformed JPEG2000 objects numbering in the hundreds of thousands. The difference between a malformed JPEG2000 file and a corrected, well-formed JPEG2000 file, in […]

By ross-spencer, posted in ross-spencer's Blog

9th Jul 2014  12:31 AM  13257 Reads  1 Comment

Authors: Martin Schaller, Sven Schlarb, and Kristin Dill In the SCAPE Project, the memory institutions are working on practical application scenarios for the tools and solutions developed within the project. One of these application scenarios is the migration of a large image collection from one format to another. There are many reasons why such a […]

By shsdev, posted in shsdev's Blog

24th Jun 2014  9:12 AM  12055 Reads  No comments

In my last blog post about ARC to WARC migration I did a performance comparison of two alternative approaches for migrating very large sets of ARC container files to the WARC format using Apache Hadoop, and I said that resolving contextual dependencies in order to create self-contained WARC files was the next point to investigate […]

By shsdev, posted in shsdev's Blog

24th Mar 2014  4:13 PM  15297 Reads  No comments