Blogs: Representation Information

Blog posts filtered by the Representation Information subject tag.

Browse blogs by subject

All subjects Access Analysis Android apache tika ApacheTika AQuA ARC ARC to WARC archives archiving audiovisual Benchmark benchmarking best practice best practices Bit rot bitcurator board game British Library Characterisation Community compression Corpora curation Database Database Archiving Database Preservation Delivery Digital Forensics digital preservation digitisation Disk Images DROID E-ARK Project EaaS Education Emulation epub Experimentation extensible Fido File Formats FLAC Flashback floppy disk floppy disks floppy drive Format Identification Format Registry GitHub Hackathon Hardware obsolescence help httpreserve Identification IDPD17 IMPACT Internet Standards iPRES. community survey isolyzer jhove job JP2 JPEG2000 jpylyzer LZW magnetic media Matchbox MediaConch Members Metadata metadate Migration Monitoring Normalisation OCR open Open Planets Foundation Open Preservation Foundation Open source OPF diary Optimization Packaging PDF PDF/A Planets policy PREFORMA PREMIS preservation Preservation Actions preservation planning Preservation Risks Preservation Strategies Preservia Process Projects PRONOM Provenance pywb recordkeeping records Representation Information Research data research infrastructure Resources RFC Rogues Gallery Rosetta Roy SCAPE Server Siegfried Signature Development Software Software benchmarking SPARQL specification spreadsheets SPRUCE standards technical technical registry testing TIFF Tika Tools training validation veraPDF Virtual Machines w3c WARC Watch WAV WAVE Web Archiving Web Publications wget Wikidata Workflow Workflows Zip

Last year (2012) the KB released a report on the suitability of the EPUB format for archival preservation. A substantial number of EPUB-related developments have happened since then, and as a result some of the report's findings and conclusions have become outdated. This applies in particular to the observations on EPUB 3, and the support […]

By johan, posted in johan's Blog

23rd May 2013  2:23 PM  17469 Reads  No comments

“Characterization” can mean many things (I’m particularly fond, especially in this context, of the OED’s “creation of a fictitious character or fictitious characters”). Back in October Paul Wheatley suggested that digital preservation practitioners needed “better characterisation” and defined this as enabling them to determine the condition, content and value of digital records prior to ingest […]

By pixelatedpete, posted in pixelatedpete's Blog

15th Mar 2013  12:23 PM  15237 Reads  1 Comment

The most important new feature of the recently released PDF/A-3 standard is that, unlike PDF/A-2 and PDF/A-1, it allows you to embed any file you like. Whether this is a good thing or not is the subject of some heated on-line discussions. But what do we actually mean by embedded files? As it turns out, […]

By johan, posted in johan's Blog

9th Jan 2013  1:42 PM  128772 Reads  16 Comments

The PDF format contains various features that may make it difficult to access content that is stored in this format in the long term. Examples include (but are not limited to): Encryption features, which may either restrict some functionality (copying, printing) or make files inaccessible altogether. Multimedia features (embedded multimedia objects may be subject to […]

By johan, posted in johan's Blog

19th Dec 2012  3:15 PM  16405 Reads  1 Comment

As many of you may know, Cal Lee, Andi Rauber and myself recently attempted to facilitate a broad discussion on emerging research challenges within the DP community at a workshop at IPRES 2012. We solicited – and received (thanks again to all contributors!) – wide-ranged contributions from Europe, North America, and New Zealand. The invitation […]

By cbecker, posted in cbecker's Blog

13th Nov 2012  8:08 AM  13231 Reads  No comments

Coming up in the next month are two excellent OPF hackathons: the “Emulation, learn from the experts” hackathon and the “CURATEcamp 24 hour worldwide file id hackathon“. One follows the other with the emulation hackathon running from the 13th of November till the 15th and the File ID hackathon on Friday the 16th.  This seems […]

By Euan Cochrane, posted in Euan Cochrane's Blog

29th Oct 2012  2:29 AM  12364 Reads  No comments

I've already written a number of blog posts on format validation of JP2 files. Format validation is only a one aspect of a quality assessment workflow. Digitisation guidelines typically impose various constraints on the technical characteristics of preservation and access images. For example, they may state that a preservation master must be losslessly compressed, and […]

By johan, posted in johan's Blog

4th Sep 2012  11:04 AM  16563 Reads  2 Comments

The purpose of this post is to give a brief introduction to creating, editing and submitting format signatures (or ‘magic‘ entries) for the well-known File tool. The occasion for this was some work I did last week on improving File‘s identification of the JPEG 2000 formats. I had some difficulty finding any easy-to-follow documentation that […]

By johan, posted in johan's Blog

9th Aug 2012  11:53 AM  32745 Reads  1 Comment

In this blog post I'll be dusting off some old stuff for a change. The occasion for this is the following question, posted by Paul Wheatley on the Libraries and Information Science Stack Exchange website a few days ago: What preservation risks are associated with the PDF file format? This reminded me of a report […]

By johan, posted in johan's Blog

26th Jul 2012  9:48 AM  17873 Reads  3 Comments

Over the last few years, the EPUB format has gained widespread popularity in the consumer market. The KB has been approached by a number of publishers that wish to use EPUB for delivering some of their electronic publications. Surprisingly little information is available on the format's suitability for archival preservation, apart from Library of Congress' […]

By johan, posted in johan's Blog

18th Jun 2012  12:48 PM  20391 Reads  4 Comments