Blogs: PDF

Blog posts filtered by the PDF subject tag.

Browse blogs by subject

The PDF Association is pleased to announce its first event in the southern hemisphere! With special focus on archiving and accessibility, PDF Day Australia is a unique opportunity for those in the Asia / Pacific region to engage with the world’s leading PDF technology experts on a broad range of subjects. Comprising 17 sessions in […]

By Becky, posted in Becky's Blog

25th Nov 2016  12:00 AM  0 Reads  No comments

On 11th October we held our first JHOVE online hack day. Our aim was to catalogue error messages produced by JHOVE to get a better understanding of their meaning and potential preservation impact. Background: organising an online hack day We have been considering running online hackathons because attending face-to-face events has become more difficult as […]

By Becky, posted in Becky's Blog

19th Oct 2016  10:06 AM  666 Reads  No comments

The PREFORMA project is running a webinar series throughout September. The final webinar focuses on veraPDF. Overview The PREFORMA project’s prototyping phase finishes at the end of 2016. The veraPDF consortium will be producing a v1.0 release candidate of the software library and applications in December 2016. This webinar demonstrates the current state of veraPDF […]

By Becky, posted in Becky's Blog

22nd Sep 2016  2:00 PM  0 Reads  No comments

Inspired by Jenny Micham’s blog post about developing her first file format signature, I thought it would be fun to take a crack at creating one myself. I previously dipped my toe into the world of contributing to PRONOM by looking at a few mis-identifications and multi-identifications, but I had yet to create a file […]

By Andrea Byrne, posted in Andrea Byrne's Blog

8th Sep 2016  9:14 AM  1085 Reads  4 Comments

Practical experiences with archiving PDF files This webinar deals with archiving PDF files. As PDF files in our repository have myriad data producers, the heterogeneity of PDF files is overwhelming. Unfortunately this means the creation of errors as well. Usually, the original data producers cannot be contacted any more, therefore we have to do the […]

By Becky, posted in Becky's Blog

20th Jul 2015  1:00 PM  0 Reads  No comments

The Open Preservation Foundation and Digital Preservation Coalition, with support from the European Commission and the PREFORMA project, invite members to a briefing day on preserving PDF at Oxford University on Wednesday 15th July 2015. PDF is a ubiquitous format for publishing and sharing digital documents. It provides a useful tool for dissemination and because […]

By Becky, posted in Becky's Blog

15th Jul 2015  10:00 AM  0 Reads  No comments

The CINES is a French centre whose one of its main missions is digital archiving. The team in charge of this task is made up of archivists, developers and files formats experts. They have developed a platform called PAC (Plateforme d’Archivage du CINES for Archiving Platform of the CINES) through which users can check their […]

By Franklin Boumda, posted in Franklin Boumda's Blog

29th Jan 2015  4:07 PM  2660 Reads  1 Comment

It is well-known that PDF documents can contain features that are preservation risks (e.g. see here and here). Migration of existing PDFs to PDF/A is sometimes advocated as a strategy for mitigating these risks. However, the benefits of this approach are often questionable, and the migration process can also be quite risky in itself. As […]

By johan, posted in johan's Blog

27th Aug 2014  3:47 PM  16101 Reads  9 Comments

This blog follows up on three earlier posts about detecting preservation risks in PDF files. In part 1 I explored to what extent the Preflight component of the Apache PDFBox library can be used to detect specific preservation risks in PDF documents. This was followed up by some work during the SPRUCE Hackathon in Leeds, […]

By johan, posted in johan's Blog

27th Jan 2014  3:08 PM  16068 Reads  7 Comments

Last winter I started a first attempt at identifying preservation risks in PDF files using the Apache Preflight PDF/A validator. This work was later followed up by others in two SPRUCE hackathons in Leeds (see this blog post by Peter Cliff) and London (described here). Much of this later work tacitly assumes that Apache Preflight […]

By johan, posted in johan's Blog

25th Jul 2013  12:57 PM  21321 Reads  12 Comments